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ABSTRACT 


A  mathematical  expression  for  combining  the  entire 
failure  rate  curve  is  derived  based  on  the  assumption  that 
the  failure  population  is  composed  of  three  subpopulations, 
early,  chance,  and  wearout.  A  graphical  method  is  provided 
for  separating  the  subpopulations  and  determining  the  para¬ 
meters  of  the  model.  The  expression  is  then  applied  to 
observed  failure  data  in  three  detailed  examples  and  in 
each  case  the  model  is  shown  to  represent  the  observed  data 
at  the  .05  significance  level  using  the  Kolmogorov- Smirnov 
Test.  Two  BASIC  language  computer  programs  are  provided  to 
simplify  the  use  of  the  proposed  model.  The  proposed  ex¬ 
pression  is  compared  with  techniques  presently  used  to  model 
failure  data  and  is  shown  to  be  superior  in  three  ways: 

1.  It  is  more  accurate  than  methods  presently  in  use. 

2.  It's  greater  flexibility  permits  the  modeling  of 
data  which  is  beyond  the  capabilities  of  present  failure 
modeling  techniques. 

3.  The  proposed  model  yields  essential  information 
for  managers  as  well  as  theoreticians  concerning  failure 
periods  and  underlying  failure  causes,  information  which 
is  obscured  by  present  modeling  methods. 
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CHAPTER  I 


INTRODUCTION 

This  study  has  two  purposes.  The  primary  purpose 
is  to  propose  a  general  mathematical  model,  derived  from 
reliability  theory,  which  will  predict  military  electronic 
equipment  component  failure  rates  more  precisely  than  cur¬ 
rent  models.  A  subsidiary  purpose  is  to  test  the  hypothesis 
that  the  proposed  model  will,  as  a  result  of  the  nature  of 
its  derivation,  provide  a  mathematical  means  of  isolating 
the  underlying  causes  of  component  failure  as  a  possible 
alternative  to  the  costly  and  time  consuming  technique  of 
physically  examining  each  failed  item. 

Reliability  may  be  defined  as,  "The  probability  of  a 
device  performing  its  mission  adequately  for  the  period  of 
time  intended  under  the  operating  conditions  expected  to  be 
encountered"  (1).  Equipment  reliability  is  one  of  the  bases 
for  procurement  of  military  hardware  and  is  a  central  con¬ 
sideration  for  the  allocation  of  maintenance  resources.  The 
reliability  of  a  new  item  of  military  electronic  equipment 
is  a  function  of  the  individual  failure  rates  of  its  compon¬ 
ent  parts.  Hence,  a  more  precise  component  failure  rate 
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model  would  produce  a  resultant  increase  in  the  accuracy  of 
end  item  reliability  predictions. 

The  theoretical  basis  for  current  failure  rate 
models  must  be  considered  before  fully  understanding  why 
there  is  still  room  for  improving  model  accuracy.  Funda¬ 
mental  reliability  theory  indicates  that  three  distinct 
failure  phenomena  occur  during  the  operational  life  of 
components  of  military  electronic  equipment.  These  pheno¬ 
mena  are  infantile,  deterioration,  and  chance  failures.  In¬ 
fantile  failures  occur  early  in  component  life  and  normally 
components  failing  in  this  mode  are  considered  to  have  been 
substandard  prior  to  operation.  Deterioration  failures 
occur  late  in  component  life,  during  the  wearout  period, 
and  are  the  result  of  physical  or  chemical  aging.  Chance 
failures  are  caused  by  randomly  occurring  stresses  that  ex¬ 
ceed  component  strength.  These  failures  are  not  a  function 
of  component  age,  but  are  as  likely  to  occur  in  early  life 
as  in  the  wearout  period.  Chance  failures  are  the  only 
failure  phenomena  to  occur  during  the  period  of  component 
age  between  the  end  of  early  life  and  the  beginning  of 
wearout.  It  is  during  this  period  that  it  is  most  economical 
to  operate  components  and  therefore  the  period  has  been 
named  component  useful  life.  Figure  1.1  is  a  graphical 
representation  of  failure  rate  versus  component  age.  It  is 
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noteworthy  that  each  of  the  three  failure  periods  exhibits 
mi  individual  failure  rate  characteristic  which  is  decreas¬ 
ing  during  early  life,  constant  during  useful  life,  and  in¬ 
creasing  during  wearout.  The  result  of  the  combination  of 
these  three  failure  rate  periods  is  the  nonmonotonic  failure 
rate  curve  or  more  commonly,  the  bathtub  curve  shown  in 
Figure  1.1. 

Although  the  failure  behavior  described  above  is 
widely  accepted  in  Reliability  Engineering  literature  and 
provides  the  basis  for  much  of  classic  Reliability  Theory, 
very  little  has  been  done  to  combine  the  entire  failure  rate 
curve  mathematically  by  the  use  of  a  unified  expression. 
Instead  the  failure  population  is  usually  assumed  to  exhibit 
only  one  or  possibly  two  of  the  three  failure  rate  periods. 
There  are  three  methods  by  which  this  assumption  is  applied: 

1.  The  early  failure  period  may  be  neglected  by 
assuming  that  the  population  is  fully  burned- in  or  debugged 
prior  to  the  analysis  of  failure  data.  The  failure  data  may 
then  be  modeled  by  a  statistical  distribution  which  exhibits 
a  monotonically  increasing  failure  rate. 

2.  The  wearout  failure  period  may  be  neglected  by 
truncating  the  analysis  of  failure  data  prior  to  the  onset  of 
wearout.  The  failure  data  may  then  be  modeled  by  a  statist¬ 
ical  distribution  which  exhibits  a  monotonically  decreasing 
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failure  rate. 

3.  Both  early  and  wearout  periods  may  be  neglected 
by  combining  procedures  1  and  2  above.  In  this  case  the 
failure  rate  is  assumed  to  be  constant  and  may  be  associated 
with  either  increasing  or  decreasing  monotonic  failure  rate 
distributions. 

This  method  of  replacing  a  complex  nonmonotonic 
problem  with  a  simpler  monotonic  sub-problem  is  the  present 
basis  for  the  analysis  of  military  electronic  equipment  com¬ 
ponent  failures.  The  obvious  difficulty  with  this  approach 
is  that  it  does  not  address  the  reality  that  many  component 
populations  do,  in  fact,  demonstrate  all  three  failure 
periods  just  as  depicted  in  Figure  1.1.  Consequently,  by 
making  a  simplifying  assumption,  part  of  the  problem  itself 
has  been  assumed  away.  Data,  so  analyzed  are  not  fully 
productive  and  potentially  useful  information  concerning 
failure  cause  is  unnecessarily  lost. 

It  is  proposed  that  a  unified  expression  capable  of 
combining  the  entire  failure  rate  curve  would  overcome  the 
difficulties  of  present  failure  rate  models.  It  is  further 
hypothesized  that  such  a  model  would  provide  more  accurate 
predictions  and  that  additional  information  gleaned  from 
present  raw  failure  data  by  the  proposed  model  would  be 
useful  in  isolating  the  under lying  causes  of  component 
failure.  The  proposed  model  will,  in  all  likelihood,  be 
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more  complex  than  are  current  models,  but  with  the  present 
availability  of  high  speed  digital  computers  this  is  no 
longer  an  important  disadvantage. 

The  thesis  is  organized  such  that  the  early  chapters 
review  fundamental  reliability  theory  and  provide  a  basis 
for  the  latter  applicatory  chapters.  Chapters  2  through  6 
and  parts  of  Chapters  7  and  8  have  been  extracted  from  a 
previous  report  by  the  author  (2).  Portions  of  Chapters  4, 

6  and  8  also  appear  in  a  paper,  co-authored  by  Kececioglu, 
published  in  the  1971  Transactions  of  the  American  Society 
for  Quality  Control  (3). 
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CHAPTER  2 


FUNCTIONS  USED  IN  LIFE  TESTING 

2.1  Introduction 

If,  during  a  nonreplacement  life  test  of  N  like 
items,  after  a  time  t,  N  (t)  items  have  survived,  we  esti- 

S 

mate  the  reliability  of  any  one  item  at  time  t  to  be  the 
ratio  of  surviving  items  at  time  t  to  the  total  number  of 
items  in  test. 

N  ,  v 

R(t)  -  'gW  (2.1) 

Similarly,  the  unreliability  is  estimated  as  the  ratio  of 
the  cumulative  number  of  items  failing  by  time  t  to  the 
total  number  of  items  in  test. 

Nf(t) 

Q(t)  -  -jj -  (2.2) 

Since 

N  -  Nf(t)  +  Ng(t)  (2.3) 


We  may  write 
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R(t)  +  Q ( t )  =  1  (2.4) 

These  expressions  are  the  fundamental  definitions  of  reli¬ 
ability  theory.  Next  we  shall  examine  some  basic  functions. 

2.2  Probability  Density  Function 

In  life  testing,  failure  records  are  maintained  on 
the  items  in  test.  The  fraction  of  the  total  items  in  test 
which  fail  during  time  At,  between  t  and  fc  +  At,  may  be 
plotted  against  time,  to  obtain  a  failure  histogram.  If 
the  time  increment, At,  is  allowed  to  approach  zero  as  a 
limit  as  N  becomes  very  large,  the  histogram  approaches  a 
smooth  curve,  called  the  probability  density  function,  f(t). 
Probability  density  functions  for  three  specific  distribu¬ 
tions  are  plotted  versus  time  in  Figs.  3.1c,  3.2c,  and  3.3c. 
Mathematically,  f(t)  is  written 

d  MO 

f(t)  "  h  dt  (2.5) 

The  area  under  f(t)  represents  the  cumulative  failures  per 
,  component  in  test,  which  has  previously  been  defined  as  the 

probability  of  failure  or  unreliability,  Q(t).  If  f(t)  is 
i  integrated  over  the  limits  to  +  •»  ,  the  cumulative 

number  of  items  failing  will  become  equal  to  the  total  num¬ 
ber  of  items  in  test  and,  therefore, 


By  the  definition  of  f(t), 


f(t)  >  0  (2.7) 

for  -«  <  t  <o*. 

Equations  (2.6)  and  (2.7)  are  the  necessary  and  sufficient 
conditions  for  a  function  to  be  considered  a  probability 
density  function  (4,  p.  10). 

The  area  under  the  f(t)  curve  to  the  left  of  time  t 
represents  the  cumulative  failures  per  component  which  have 
occurred  prior  to  t;  therefore, 


St 

-M 

where  |  is  a  dummy  variable  of  integration.  Since 
R(t)  ■  l-Q(t),  equation  (2.8)  may  be  written 


R(t) 


(2.8) 


(2.9) 


From  equations  (2.6)  and  (2.9)  it  follows  that 


R(t)  - 


which  reduces  to 


(2.10) 


(2.11) 
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Since,  by  definition  f(«o)  ■  0,  we  find  by  differentiation 
that 


* 


«t)  -  - 


(2.12) 


2.3  Instantaneous  Failure  Rate  Function 

It  is  useful  in  life  testing  to  construct  a  failure 
rate  histogram  similar  to  the  failure  histogram  constructed 
in  Section  2.2,  but  with  an  all  important  difference.  In 
this  case  we  plot  the  fraction  of  the  number  of  items  sur¬ 
viving  at  t  which  fail  during  At.  If  the  incremental  time, 
At,  approaches  zero  as  a  limit  as  N  becomes  very  large, 
the  histogram  approaches  a  smooth  curve  called  the  instan¬ 
taneous  failure  rate  function,  ^(t) .  Other  names  in  common 
usage  for  this  function  are  hazard  rate,  force  of  mortality, 
and  failure  rate.  The  failure  rate  versus  time  is  plotted 
for  three  specific  distributions  in  Figs.  3.1a,  3.2a,  and 
3.3a.  Mathematically,  the  failure  rate  is  written 


*<t) 


d  Nf(t) 
— dt 


(2.13) 


Multiplying  and  dividing  equation  (2.13)  by  N  gives 


A(t) 


N  d[Nf  <»)/"] 

N  (t)  dt 
s 


(2.14) 


1  dLR(t 

■"l)  '  -  RTcT  Tt 


From  equation  (,2.12) 


-  dlsiiD.  f(t) 

at 


(2.15) 


therefore, 


*(t)  ■  IB} 


(2.16) 


2.4  Reliability  Function 

Equation  (2.15)  is  an  ordinary  differential  equa¬ 
tion  which  can  be  solved  for  R(t).  Rearranging  (2.15) 


dfR(t)]  -  -  fc(t)  dt 


Integrating 


sNff-~£ 


A<f)df 


which  yields 


In  R(f) 


C  ■  ■£ 


If  it  is  assumed  that  at  t.  -  0  no  items  have  failed,  the 
initial  condition  R(0)  ■  1  is  obtained.  Therefore, 


J2 


R(t)  -  e~  (2.17) 

2 . 5  Mission  Reliability  Function 

Mission  reliability  is  defined  as  the  probability 
of  survival  of  an  item  during  a  mission  of  duration  T, 
given  that  the  item  or  component  has  not  failed  prior  to 
the  start  of  the  mission  at  time  t. 

Bazovsky  (5,  p.  44)  defines  the  a  posteriori  prob¬ 
ability  of  failure  as 

Q(t,  T)  -  (?.18) 

Mission  reliability  is  the  complement  of  Q(t,  T), 

R(t,  T)  »  1  -  (2.19) 

which  may  be  written 

R(t.  T)  -  i  T At 

or  after  simplifying 

R(t,  T)  -  -^ty (2.20) 

2.6  Summary 

It  is  evident  that  the  functions  derived  in  this 


section  are  related  to  each  other  in  such  a  way  chat  if  an 
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expression  is  known  for  one  function,  the  expression  for  the 
other  functions  can  be  found  directly. 

Thus  far,  the  discussion  has  not  been  based  on  any 
particular  distribution.  The  equations  derived  in  this 
section  are  always  true  regardless  of  the  failure  mechanism 
or  type  of  failure  distribution  involved.  Chapter  3  will  be 
concerned  with  some  of  the  more  common  distributions  which 
have  been  used  to  represent  specific  types  of  failure. 
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2. 7  Symbols  Introduced  in  Chapter  2 

f(t)  Probability  density  function 

in  Natural  I  ogari  thin 

N  Total  number  of  items  in  test 

N^( t)  Cumulative  number  of  items  failing  by  time  t 

N  (t)  Number  of  items  surviving  at  time  T 

Q(t)  Unreliability  function 

R(t)  Reliability  function 

R(t,  T)  Mission  reliability  function 

T  Mission  duration 

t  Time  variable 

At  Time  increment 

J\(t)  Instantaneous  failure  rate  function 

5  Dummy  variable  of  integration 


CHAPTER  3 


DISTRIBUTIONS  USED  IN  LIFE  TESTING 


3 . 1  The  Exponential  Distribution 

The  exponential  distribution  has  proven  to  be  a 
representative  model  for  the  failure  behavior  of  many 
electrical  components  and  also  for  electrical  and  non¬ 
electrical  complex  systems  (6,  pp.  113-150).  In  re¬ 
liability  analysis  the  exponential  distribution  is  commonly 
used  to  evaluate  life  test  data  in  which  failures  occur 
randomly.  The  distribution  is  characterized  by  a  constant 
instantaneous  failure  rate  (Fig.  3.1a). 

A(t)  38  CONSTANT  (3.1) 


For  the  exponential  distribution,  the  mean  time  between 
failures  is  simply 


The  reliability  function  (Fig.  3.1b)  is  found  from  equation 
(2.17), 


0  <  t  $  +  00 


R(t)  -  e 
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For  constant  ^ 


-At 

R(t)  ■  e  0  *  +  p 


(3.3) 


The  probability  density  function  (Fig.  3,’c)  is  found  by  the 
use  of  equations  (2.16)  and  (3,3)  to  be 

-At 

f(t)  t  £  0  (3.4) 


A  preliminary  method  of  determining  if  failure  data 
follow  the  exponential  distribution  is  to  plot  the  median 
rank,  which  is  similar  in  concept  to  the  cumulative  percent 
failing,  versus  time  on  logarithmic  paper  (Ref.  7,  p.  2). 

The  median  rank,  M.R.,  in  percent  is  defined  as 

Nf (t)-0. 3 

M-p-  "  lb  o."4  (100)  (3-5) 

If  the  plotted  points  closely  approximate  a  straight 
line  the  data  are  assumed  to  be  exponentially  distributed. 
However,  since  there  is  no  measure  of  the  accuracy  of  this 
graphical  method,  a  goodness-of-f it  test  such  as  the  Chi- 
Square  or  the  Kolmogorov- Smirnov  test  should  be  used  as  veri 
fication.  If  the  data  do  not  fall  on  a  straight  line  a  ¥ 
correction  can  be  applied  as  described  in  Sec.  3.3  to 
straighten  the  curve.  If  the  JT  correction  does  not  produce 
the  desired  result,  other  probability  papers  should  be  tried 


3.2  The  Normal  Distribution 


18 


liisloirical  !y,  much  of  Lhe  curly  work  with  Lhe  normal 
distribution  concerned  heights  and  weights  of  humans  and  ani¬ 
mals,  crop  yields  for  different  soils  and  localities,  and 
student  examination  scores  (8,  p.  8).  Unlike  an  exponent¬ 
ially  distributed  population  which  suffers  its  greatest 
losses  prior  to  the  mean  time  T,  the  normally  distributed 
population  suffers  its  greatest  loss  around  the  mean  time  T. 
In  life  testing  it  has  been  observed  that  failures  of  the 
wearout  type  are  often  normally  distributed  about  a  mean 
wear out  age. 

The  normal  probability  density  function  (Fig.  3.2c) 


is 


-  ±(^>2 
2'  9  ) 


-00  <  t  ^  00 


(3.6) 


f(t) 


"■{an 


where  T  is  the  mean  time  to  failure. 


(3.7) 


and  9  is  the  standard  deviation 


The  reliability  function  (Fig.  3.2b)  is 
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R(t)  - 


1 


(3.9) 


and  the  failure  rate  function  (Fig.  3.2a)  is 


A(t) 


2'  9  > 


r 


°  - 101 


(3.10) 


A  preliminary  method  of  determining  if  failure  data 
are  normally  distributed  is  to  plot  the  Median  Ranks  (equa¬ 
tion  (3.5))  versus  time  on  normal  or  arithmetic  probability 
paper.  If  the  plotted  points  closely  approximate  a  straight 
line  the  data  are  assumed  to  be  normally  distributed.  As  in 
the  exponential  case  a  goodness-of-f it  test  should  be  used 
for  verification.  If  the  data  do  not  fall  on  a  straight 
line  other  probability  paper  should  be  tried.  King  (8,  p.9) 
suggests  that  a  concave  downward  plot  on  normal  probability 
paper  usually  suggests  a  left-skewed  distribution  and  the 
next  step  after  such  a  result  would  be  to  use  extreme  value 
probability  paper.  He  further  proposes  that  a  concave  up¬ 
ward  plot  indicates  a  right  skewed  distribution  and  such  a 
result  would  indicate  the  use  of  log-normal  probability 
paper,  although  extreme  value,  chi-square,  reciprocal,  Wei- 
bull  and  log-extreme  value  probability  papers  are  also 
possibilities. 


allure  Rate  Vorsus  Time 


uoliaoility  Versus  Time 

0>> 


;  =  Area  from  zero 

,  /fA  to  time  t 

1  I  \^R(t)  =  Area  from  t  to 
i  '  j\  infinity 


Prooaoillty  b.,f.:,ity  Versus  Time 

Figure  3.2  Failure  Rate,  Reliability ,  and  Probability  Density 
Vorsus  Time  for  formally  Distributee  Peculations 


3.3  The  Weibull  Distribution 

In  1951  Waioddi  Weibull  published  a  paper  concern¬ 
ing  the  breaking  strength  of  materials  and  the  sice  distri¬ 
bution  of  fly  ash  (9)-  Because  of  the  great  flexibility  of 
the  statistical  distribution  introduced  in  this  paper  the 
Weibull  distribution  has  been  found  to  be  quite  useful  in 
other  fields  including  reliability  testing.  The  Weibull 


probability  density  function  (Fig 

.  3.3c)  is 

6 

f(t)  "if  ("T)  e 

^ ,  t  i  1 

(3.11a) 

f(t)  *=  0 

,  t  <  ^ 

(3.11b) 

where , 

3  -  Shape  parameter 

fi  >  o 

T\-  Scale  parameter 

>1  >  O 

Location  parameter 

-»<  y<  oo 

The  effect  of  varying  the  shape  parameter  is  shown  in  Fig. 
3.3  for  ^  *  0  and  =  1/2,  1,  and  P  -  3.  For  1  ess 

than  1,  the  failure  rate  decreases  with  time  and  for  $ 
greater  than  1,  the  failure  rate  increases  with  time.  For 
$  =  1,  the  density  function  becomes  that  of  the  exponential 
case  with  constant  failure  rate  For  *  3,  the  func¬ 

tion  approximates  the  shape  of  the  normal  distribution. 


The  Weibull  reliability  function  (Fig.  3.3b)  is 
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W'  ■  V  ^  v  i 
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set)  =  e‘fV^ 


’  reliability  Versus  2iao 

(b) 


X(t) 


*Ct> 


-4  W/"’0  e'^}l 


f  /’ 

V.  .  ^  I 

•y»< 
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*  £ 


Q(t)  =  Area  fror.  aero 
to  tino  t 

Vil(v)  =  Aroa  I’rora  time  t 
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- 

R(t)  =  e  71 


(3.12) 


The  fai'ure  rate  function  (Fig.  3.3a)  is 

aw  -  g 

Other  properties  of  the  Weibull  Distribution  which 
are  useful  in  life  testing  are! 

The  mean  time  to  failure, 

T  -  V  +[r(l  +  ^)]  (3.14) 

where X"*  is  the  Gamma  function. 

The  standard  deviation, 

<r  +f)  T2(1  t  j>?/2  (3-15) 

The  modal  value  of  t,  tliat  is,  the  time  at  which  the  great¬ 
est  number  of  failures  occur, 

^  1  lf% 

T  •  Y +7?(1  -  jj-)  (3.16) 

The  median  value  of  t,  that  is,  the  time  at  which  half  the 
units  in  test  will  have  failed, 

T  *  Y+7|(in  2)Ml  (3.17) 

Determining  if  failure  data  follow  the  Weibull 


distribution  is  more  complicated  than  the  procedure  for  the 
normal  or  exponential  distributions.  The  first  step  is  to 
find  the  median  rank  (equation  (3.5)). 


Lochner  provides  a  convenient  table  for  median 
ranks  which  is  good  fcr  fifty  or  fewer  items  in  test  (10). 
Note  that  if  N  and  Nf(t)  are  very  large, 

M,R.  «*100  Q(t)  (3.18) 

Once  the  median  ranks  are  determined  they  are  plotted  ver¬ 
sus  the  times  to  failure  on  Weibull  Probability  Paper.  If 
the  plotted  points  approximate  a  straight  line  it  is  con¬ 
cluded  that  the  distribution  is  Weibull  and  the  location 
parameter  If  is  equal  to  zero  (Fig.  3.4a).  It  is  far  more 
likely,  however,  that  the  plotted  points  will  approximate  a 
curved  rather  than  a  straight  line  on  the  first  trial.  If 
this  is  the  case,  f  is  not  equal  to  zero  and  it  is  necessary 
tc  plot  the  median  ranks  again;  but  this  time  versus  t  -  t^, 
where  t^  is  the  time  that  the  first  failure  was  recorded. 
Three  possible  cases  may  arise: 

Case  1.  Both  sets  of  data  curve  up  (Fig.  3.4b).  'This 

indicates  that  -«o<  <  0.  Continue  plotting  the 

median  ranks  against  t  -  until  a  value  of 
is  found  which  gives  M.R.  versus  t  -  H'\  as  a 
straight  line. 

r^se  2.  The  first  set  of  data  curves  down  and  the  second 

set  (t  -  t^)  curves  up  (Fig.  3,4c).  This  indicates 
that  0  <  Continue  plotting  the  median 

ranks  against  t  -  for  positive  values  of 


between  0  and  t^,  until  a  value  of  if  ^  is  found 
which  gives  M.R,  versus  t  •  ^  as  a  straight  line. 

Caja  3.  Both  sets  of  data  curve  down  (Fig.  3.4d).  The 
failure  data  being  analysed  do  not  follow  the 
Weibull  distribution  and  other  probability  dis¬ 
tributions  should  be  considered* 

If  Case  1  or  Case  2  prevails,  the  data  fit  the  Weibull  dis¬ 
tribution  and  #  is  equal  to  the  trial  ^  which  produces  the 
best  straight  line  approximation. 

The  shape  parameter,  &  ,  is  actually  a  measure  of 
the  slops  of  the  M.R.  versus  t  -  line.  Most  Weibull 
papers  have  an  origin  through  which  a  line  may  be  drawn 
parallel  to  the  M.R.  versus  t  -  If  line.  This  parallel  line 
will  intersect  the  scale  on  the  paper  and  the  value  of  $ 
can  be  read  directly.  To  find  the  value  of  the  scale  para¬ 
meter,  ,  first  find  the  intersection  of  the  M.R.  versus 
t  -  Y  line  with  the  horizontal  line  at  M.R.  =  63.2%.  From 
this  point  of  intersection  drop  a  vertical  ’ine.  Read  the 
value  of  t  -  If  where  the  vertical  line  crosses  the  abcissa. 
This  value  of  t  -  Y  is  7f. 

Since  the  graphical  method  gives  no  measure  of 
the  accuracy  of  the  fit  of  the  distribution  to  the  data, 
it  is  prudent  to  verify  the  graphical  procedure  by  the  use 
of  a  goodness-of-f it  test. 


3.4  Test  for  Goodness-of-f it 


where  i  *  1  for  the  first  failure,  2  for  the  second  failure 
and  finally,  i  =  N  for  the  Nth  failure. 

The  maximum  value  of  is  compared  with  values 
from  the  Kolmogorov-Smimov  Table  of  significance  levels 
(Table  3.1).  If  the  value  of  the  maximum  is  less  than 
the  value  given  by  the  Kolmogorov- Smirnov  Table  for  the 
sample  size  N  and  at  a  specified  significance  level,  we  can 
accept  the  distribution  in  question  at  the  specified  level 
of  significance.  If,  '/n  the  other  hand,  the  maximum  is 


greater  than  the  value  in  the  table,  the  distribution  is 
reiected  at  the  specified  significance  level. 

for  example,  Lt  is  desired  to  know  if  the  exponen¬ 
tial  distribution  may  be  used  at  the  .05  level  of  signi¬ 
ficance  to  represent  the  failure  data  of  10  items  in  life 
test.  The  D^'s  are  calculated  at  the  10  times  to  failure. 
The  maximum  absolute  difference  is  found  to  be  *  .45732, 
Entering  Table  3.1  with  N  of  10  and  a  significance  level 
of  .05,  the  maximum  allowable  value  of  D  at  the  .05  level 
of  significance  is  read  as  .410.  Since  is  higher  than 
the  allowable  value  the  exponential  distribution  is  rejected 
with  a  level  of  significance  of  .05.  Had  been  less  than 
.410  the  exponential  distribution  would  have  been  accepted 
with  a  significance  level  of  .05.  Note  that  the  values  of 
maximum  allowable  D  become  more  stringent  with  increasing 
values  of  the  level  of  significance.  For  most  purposes  and 
for  this  paper  a  level  of  significance  of  .05  is  considered 
sufficient  to  accept  a  trial  distribution. 


Table  3.1  D  values  for  the  Kolmogorov- Smirnov  Coodness-of- 
fit  Test  at  Various  Significance  Levels  and 
Sample  Sizes  (11) 


Sample 

Size 


(N) 


Level  of  Significance 


0.20  0.15  0.10  0.05  0.01 


1 

0.900 

2 

0.684 

3 

0.565 

4 

0.494 

c 

0.446 

6 

0.410 

7 

0.381 

S 

0.358 

9 

0.339 

1C 

0.322 

11 

0.307 

]  2 

0.295 

13 

0.284 

14 

0.274 

15 

0.266 

16 

0.258 

17 

0.250 

18 

0.244 

19 

0.237 

20 

0.231 

25 

0.21 

30 

0.19 

35 

0.18 

0.925 

0.950 

0.726 

0.776 

0.597 

0.642 

0.525 

0,564 

0.474 

0.510 

0.436 

0.470 

0.405 

0.438 

0.381 

0.411 

0.360 

0.388 

0.342 

0.368 

0.326 

0.352 

0.313 

0.338 

0.302 

0.325 

0.292 

0.314 

0.283 

0.304 

0.274 

0.295 

0.266 

0.286 

0.259 

0.278 

0.252 

0.272 

0.246 

0.264 

0.22 

0.24 

0.20 

0.22 

0.19 

0.21 

0.975 

0.995 

0.842 

0.929 

0.708 

0.828 

0.624 

0.733 

0.565 

0.669 

0.521 

0.618 

0.486 

0.577 

0.457 

0.543 

0.432 

0.514 

0.410 

0.490 

0.391 

0.468 

0.375 

0.450 

0.361 

0.433 

0.349 

0.418 

0.338 

0.404 

0.328 

0.392 

0.318 

0.381 

0.309 

0.371 

0.301 

0.363 

0.294 

0.356 

0.27 

0.32 

0.24 

0.29 

0.23 

0.27 

Over 

35 


1.07 

r 


1.14 


1.22 


1.36 


1.63 
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3 . j  Symbols  Introduced  in  Chapter  3 


D 


e 


i 

j 


M.R. 


Qexp(ti> 


WV 

r 


T 

T 

r 

ar 

71 

cr 


Absolute  difference  between  expected  and 
observed  unreliabilities 
Base  of  natural  logarithms,  e  =  2.71828.... 
Subscript  indicates  1,2,3,...  in  turn 
Subscript  indicates  1,2,3,...  in  turn 
Median  Rank 

Expected  unreliability  at  time  t^  based  on 
a  known  distribution 

Observed  value  of  unreliability  at  time  t^ 
Median  time  (half  of  units  in  test  have 
failed) 

Modal  time  (greatest  number  of  failures  occur) 
Mean  time  (arithmetic  average  of  failure 
times) 

Gamma  function 
Weibuli  shape  parameter 
Weibull  location  parameter 
Weibuli  scale  parameter 
Standard  deviation 


CHAPTER  4 


THE  FAILURE  RATE  CURVE 


4. I  Introduction 

In  the  early  1950's,  after  plotting  failure  rate 
data  over  a  period  of  years  for  various  electrical  and 
complex  mechanical  devices,  it  became  evident  to  researchers 
that  the  failure  rate  curves  of  many  of  these  unrelated 
items  had  certain  characteristics  in  common.  It  was  ob¬ 
served,  for  example,  that  in  the  early  portion  of  component 
life  there  was  an  initially  high  failure  rate  which  de¬ 
creased  with  increasing  component  age.  During  the  long 
middle  portion  of  component  life  the  failure  rates  were  ob¬ 
served  to  level  off  and  become  relatively  constant.  Fail¬ 
ures  during  this  period  occurred  at  random  intervals  not 
related  to  component  age.  Finally,  an  increasing  failure 
rate  was  noted  as  the  components  became  worn.  An  idealized 
failure  rate  curve  of  the  type  described  above  is  shown  in 
Fig.  4.1a.  The  shape  of  the.  curve  has  earned  it  the  name, 
"bathtub  curve."  The  failure  rate  curve  is  of  particular 
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interest  in  this  study  because  the  three  failure  regions, 
early,  chance,  and  wearout  are  more  distinct  on  this  curve 
than  on  either  the  R(t)  or  f(t)  curves  (Fig.  4.1b,c). 

The  necessity  for  separating  the  failure  regions  is 
apparent  when  it  is  considered  that  each  failure  mode  fol¬ 
lows  a  distinct  statistical  distribution,  and,  therefore, 
requires  individual  mathematical  treatment.  A  second  reason 
for  separating  the  failure  regions  is  that  each  failure 
mode  requires  a  different  physical  technique  to  improve  re¬ 
liability. 

In  the  next  three  sections  the  principle  mechanisms 
accounting  for  early,  chance,  and  wearout  failures  will  be 
considered. 

4 . 2  Eiarly  Failures 

A  high  incidence  of  early  failures  in  a  component 
population  is  an  indication  of  poor  quality  control  and 
improper  debugging  and  burn-in  procedures.  Substandard  com¬ 
ponents  are  initially  weaker  than  the  good  components  in  a 
mixed  population  and  therefore  they  fail  at  much  lower 
levels  of  stress  (Fig.  4.2a).  As  the  higher  failure  rate, 
substandard  items  fail  and  are  removed  from  the  population, 
the  population  failure  rate  decreases  (12,  p.  4). 

The  Weibull  distribution  with  $<1  has  been  used 
to  represent  the  early  failure  period.  Mendenhall  and 


34 


Hader  have  also  shown  that  a  combination  of  two  exponential 
distributions  with  different  values  of  ^  for  the  early  and 
chance  failure  periods  will  give  the  characteristic  de¬ 
creasing  failure  rate  of  the  early  period  (13). 

4. 3  Chance  Failures 

Chance  failures  occur  randomly  and  are  independent 
of  component  age;  that  is,  they  occur  during  the  entire 
period  that  a  component  is  in  service  and  not  just  during 
the  chance  period.  In  the  early  period,  chance  failures 
occur  together  with  early  or  substandard  failures,  and  in 
the  wearout  phase,  chance  failures  occur  with  the  wearout 
failures.  During  the  chance  failure  period  it  is  assumed 
that  only  chance  failures  occur.  All  the  substandard  items 
have  already  failed  and  wearout  has  not  yet  begun.  Chance 
failures  are  caused  by  sudden,  unpredictable  stress  accumu¬ 
lations  (Fig.  4.2b). 

The  exponential  distribution  is  most  frequently 
used  to  model  the  theoretically  constant  failure  rate  which 
characterizes  the  chance  period  of  component  life.  A  more 
flexible  model  is  the  Weibull  distribution  with  $  approxi¬ 
mately  equal  to  1.  This  distribution  does  not  require  an 
absolutely  constant  failure  rate  in  order  to  have  a  good 
fit  to  observed  failure  data. 
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4.4  Wearout  Failures 


After  the  chance  period  of  life,  component  deteri¬ 
oration  begins  and  failures  become  a  function  of  age.  As 
the  surviving  components  become  older  they  also  grow  weaker 
and  more  subject  to  failure  at  lower  levels  of  stress. 

Thus,  deterioration  produces  the  characteristic  increasing 
failure  rate  of  the  wearout  period  (Fig.  4.2c). 

Both  the  normal  distribution  and  the  Weibull 
tribution  with  £  greater  than  1  have  been  successful lyms ed 
to  model  the  increasing  failure  rate  of  the  wearout  p$&od. 

4.5  Summary 

In  this  chapter  it  has  been  shown  that  the  indivi¬ 
dual  failure  events  can  be  modeled  by  known  statistical 
distributions.  It  remains  to  be  shown,  however,  that  the 
entire  bathtub  curve  including  early,  chance,  and  wearout 
failures  can  be  modeled  by  a  unified  mathematical  expression 
derived  from  accepted  reliability  theory.  The  remaining 
chapters  are  oted  to  finding  3uch  an  expression. 


CHAPTER  5 

THE  BASIS  FOR  A  UNIFIED  EXPRESSION 

5.1  Criteria  for  a  Good  Model 

Curve  fitting  techniques  could  be  used  to  fit  a 
Jj^.thematical  expression  to  observed  failure  data,  but 
tl^Ls  would  provide  very  little  insight  into  the  mechanics 
of  failure  and  would  not  be  of  great  use  in  reliability  im¬ 
provement.  A  mathematical  model  based  on  the  theory  of 
failure  would,  on  the  other  hand,  provide  relevant  informa¬ 
tion  about  failure  modes  which  would  be  quite  useful  in 
analysing  failure  data  and  for  improving  product  reliability. 
For  the  purpose  of  this  study  a  good  model  of  the  combined 
failure  rate  curve  is  defined  as  one  which  meets  the  follow¬ 
ing  criteria r 

a.  The  unified  model  must  combine  the  phenomena 
of  earily,  chance,  and  wearout  failures. 

b.  The  number  '■‘f  restrictive  assumptions  necessary 
to  derive  the  model  should  be  minimal. 

c.  The  model  should  be  mathematically  simple  if  at 
all  possible. 
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d.  The  model  should  correspond  to  observed  failure 
data  at  the  .05  significance  I  eve  J  lor  Iht* 
Kolmogorov- Smirnov  Goodness-of-fit  Test. 

e.  The  model  should  be  useful  for  prediction  and 
for  theoretical  speculation,, 

f.  The  theoretical  basis  for  the  model  must  be 
consistent  with  the  manner  in  which  the  data 
are  generated. 

5.2  The  Product  Rule 

If  it  is  assumed  that  there  is  a  single  population 
of  identical  components  and  that  the  events  of  early  failure, 
chance  failure,  and  wearout  failure  are  independent,  that 
is  to  say,  th&t  the  occurrence  of  one  type  of  failure  in 
no  way  alters  the  probability  of  occurrence  of  the  other 
types  of  failure,  then  the  unified  unre liability  would  be 
as  shown  in  the  Venn  diagram  (Fig.  5.1).  For  independent 
events  the  joint  probability  of  several  events  occurring 
is  simply  the  product  of  the  individual  probabilities.  The 
enclosed  area  of  the  Venn  diagram  is 

Qecw(t)  ■  %(t)  +  %(t)  +  '  Qe(t)  Qc(t) 

-  Qe(t)  Q^t)  -  Qc(t)  <^(0  (5.1) 

+  Qe(”-)  Qc(t)  qyr.) 


?i£UT6  5.1  Venn  Diagram  for  S&rly,  Oh&aoo,  and  tfoarout 
failures  Oonsiderod  as  Independent  Events 
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where 
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If  each  Q  term  is  replaced  by  its  complement,  1  -  F,  the 
reliability  product  rule  is  obtained. 


where 
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(5.2) 


Equation  (5.2)  is  a  potential  model  of  the  combined  relia¬ 
bility  curve  and  it  does  meet  certain  of  the  specified 
criteria  set  forth  in  Section  5.1.  For  example,  the  product 
rule  does  combine  the  effects  of  the  three  failure  phenom¬ 
ena.  There  is  only  one  limiting  assumption:  independent 
failure  events.  And  the  product  rule  is  a  very  simple 
expression.  The  crucial  question  is  whether  or  net  the 
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product  rule  corresponds  to  the  observed  failure  data.  To 
answer  this  question,  consider  equation  (5.2)  with  the  In¬ 
dividual  reliabilities  replaced  by  their  respective  defini¬ 
tions 


R  (t) 
ecw 


N  (t)  N  (t) 


eas 


N 


N 


N  (t) 
LlT 
N 

w 


(5.3) 


At  the  end  of  the  early  period  it  is  assumed  that  all  early 
failures  have  occurred,  consequently,  N  (t,  )  -  0.  The 

0  f  S  D 

result  of  this  is  that  for  all  t  greater  than  t^»  the  com¬ 
bined  reliability  given  by  equations  (5.2)  and  (5.3)  is 
zero.  This  result  is  in  direct  conflict  with  physical 
observations  of  mixed  samples  which  indicate  that  the 
reliability  at  t  «  t^  is  normally  quite  high.  For  this 
reason  the  product  rule  which  includes  early  failures  as 
independent  failure  events  in  a  single  population  is  reject¬ 
ed  as  a  possible  model  of  the  combined  failure  rate  curve. 
The  possibility  of  using  the  product  of  reliabilities  for 
chance  and  wearout  failures  alone  will  be  considered  in 
Section  5.4. 

An  interesting  sidelight  to  the  product  rule  is  the 
convenient  but  incorrect  expression 


\  +v 

*  e  c 
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which  has  been  used,  on  occasion,  by  reliability  engineers 
to  represent  the  combined  early  and  chance  failure  rate  dur¬ 
ing  early  life.  This  simple  expression  might  appear  valid 
after  a  casual  inspection  of  the  failure  rate  curve  (Fig. 
4.1a).  However,  it  is  derived  by  the  combination  of  the 
reasonable  assumption  that  both  early  and  chance  failures 
are  of  the  exponential  type  with  the  incorrect  assumption 
that  reliability  in  early  life  is  represented  by  the  pro¬ 
duct  of  early  and  chance  reliabilities.  Mathematically, 


R  (t)  -  R  (t)  R  (t) 
ec  e  c 


or 


Taking  the  natural  log 

+\ 

which  is  incorrect  since  it  was  shown  previously  that 
Rec(t)  t  R.(t)  Rc(t) 

and  therefore. 
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5.3  The  Summation  Rule 

Consider  a  hypothetical  non-replacement,  life  test 
of  N  components.  The  components  are  placed  in  test  at 
time  t  *  0  without  prior  burn-in  or  debugging.  As  each 
component  fails,  the  time  to  failure  is  recorded  and  the 
test  is  terminated  when  the  last  component  in  test  fails. 

To  assist  in  visualizing  the  problem,  assume  that  the  com¬ 
ponents  are  of  a  type  that  can  be  examined  after  failure 
and  the  mode  of  failure  determined,  be  it  early,  chance, 
or  wearout.  As  items  fail  during  the  test  they  are  removed 
from  the  test  population,  inspected,  and  segregated  into 
three  lots  according  to  failure  type.  At  the  conclusion 
of  the  test  there  will  be  three  subpopulations  of  N  ,  N^, 
and  N  failed  items,  respectively  where  N  +  N  +  N  =  N. 

If  the  events  of  the  tests  are  now  reconstructed,  it  may  be 
theorized  that  at  time  t  -  0  there  were  actually  three 
separate  subpopulations  in  test  each  with  an  individual 
failure  density  distribution,  even  though  at  time  t  »  0 
no  failures  had  yet  occurred  and  therefore  the  subpopula¬ 
tions  were  not  physically  distinguishable.  It  is  not  perti¬ 
nent  in  making  this  assumption  that  the  items  actually  be 
inspected  to  determine  N  ,  N  ,  and  N  ,  as  the  subpopulations 
would  exist  ever,  if  these  values  were  unknown  or  were 
physically  indeterminable.  A  graphical  method  for  finding 
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N  ,  N  ,  and  N  from  observed  failure  data  is  suggested  in 

6  G  W 

Chapter  6. 

If  the  preposition  that  there  are  three  failure  sub¬ 
populations  is  accepted,  a  mathematical  expression  combining 
the  effects  of  the  subpopulations  may  be  found  directly. 

The  cumulative  number  of  items  failing  in  the  mixed  popula¬ 
tion  is  found  from  the  definition  of  unreliability 

N  .(t)  -  N  Q  (t)  (5.4) 

ecw,f  xcw 


Similarly,  the  number  failing  from  each  of  the  subpopula¬ 
tions  is 


Ne,f(t>  ■  Nc  (5-5) 

-  Nv  <*w(t> 


The  total  number  of  failures  by  time  t  in  the  mixed  popu¬ 
lation  is  simply  the  sum  of  the  failures  in  the  three  sub¬ 
populations. 


N.cw,f(t)  ■  Ne,f(t>  +  Nc,f(t)  +  Nw,f(t)  (5-6> 
Substituting  equations  (5.4)  and  (5.5)  into  equation  (5.6) 
yields 


N  «ecv(t)  '  NA(t)  +  NcV°  +  NA(t)  (5’7) 
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or 


Q  (t) 

xecw' 


N  N  N 

r  «e(t>  +  r  <Jc(t> +  ?r 


(5.8) 


This  expression  could  also  have  been  derived  directly  from 
Baves’  Theorem  (14,  p.  57),  which,  for  this  example,  says 
that  the  unreliability  of  a  component  drawn  at  random  from 
a  mixed  population  composed  of  three  failure  subpopulations 
is  the  summation  of  three  terms:  the  probability  that  the 
component  is  from  the  early  subpopulation  times  its  unre¬ 
liability  if  it  is  from  the  early  subpopulation,  plus  the 
probability  that  the  component  is  from  the  chance  subpopu¬ 
lation,  times  its  unreliability  if  it  is  from  the  chance 
subpopulation,  plus  the  probability  that  the  component  is 
from  the  wearout  subpopulation  times  its  unreliability  if 
it  is  from  the  wearout  subpopulation.  This  may  be  written 
mathematically  as 


«W°  ‘  !r  +  !r  V‘>  +  r  Ve> 

* 

which  is  identical  to  equation  (5.8). 

If  in  equation  (5.8)  each  of  the  unreliability  terms 
is  replaced  by  its  complement,  Q  ■  1  -  R,  a  unified  relia¬ 
bility  expression  is  obtained  as  follows: 


i  -  go  -  J.  [i-Re<t)]  ♦  ^  [i-Rc<t>]  ♦  /  fi-sw(t)] 

N  N  N  N  N  N 

1  ‘  Recv^  “  tT  “  W  Re^  +  "  tT  Rc^  *  N  N*  Rw^ 
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s  Lnce 


N  +  N  +  N  -  N 
e  c  w 


or 


N 


N 


N 


—  +  —  +  —  -  1 
NNN 


it  follows  that 


1  -  Recw(t)  - 1  -  r  ».<»>  -  r  Rc(t>  -  r  V«> 


This  reduces  to  the  unified  reliability  function 

N  N  N 

JC 

N 


Recw^  *  N  Re^  +  Rc^  +  N  Rw^ 


(5.9) 


which  is  also  called  the  reliability  aunsnation  rule. 

The  other  functions  of  interest  are  found  by  ap¬ 
plying  the  equations  of  Chapter  2  to  equation  (5.9). 

From  equation  (2.12)  it  is  known  that 


f 

ecw 


(t) 


(5.10) 


For  the  summation  model  this  becomes 
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ecw 


(t) 


d_ 

dt 


which  ^educes  to 

W‘>  -  +  ^  £c<°  +  Js  V‘>  (5-u> 


This  equation  is  else  gJven  by  K.  L.  Wong  (4,  p.19)  as 
representing  the  combined  probability  density  function  when 
eary,  chance,  and  vearout  failures  are  present.  Applying 


equation  (2.11)  to  equation  (5.9)  it  is  shown  that 
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or, 


^cw(t)  ' 
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N  “eN~'  ’  N  *V~'  '  N  w' 

The  mission  reliability  for  a  mission  of  duration  T  start¬ 
ing  at  time  t  with  a  surviving  component  is  found  by  use  of 
equation  (2.20) 

R_„_(t  +  T)  (5.14; 


R  (t,T) 
ecw 


ecw 

R  Tt! 


ecw 


which  expands  to 

N  N  N 

rp  R  <t  +  T)  +  R  (t  +  T)  +  R  (t  +  T) 
«ecw(t.T)  -  S-8 - - — - 

r  Re(t)  +  FT  «c<*>  +  FT  *w(t> 

(5.15: 

A  cursory  check  of  the  criteria  for  a  good  model  given  in 
Section  5.1  shows  that  the  summation  rule  does  meet  certain 
of  the  specified  criter**.  Tha  Phenomena  of  early,  chance, 
and  wearout  failures  ere  combined.  The  single  assumption 
required  for  this  model  is  that  the  mixed  population  be 
composed  of  three  individual  failure  subpopulations.  The 
model  is  mathematically  simple  and  if  the  above  assump¬ 
tion  is  correct,  this  model  would  be  useful  for  prediction 
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and  theoretical  speculation.  The  correspondence  of  the 
model  to  observed  failure  data  cannot  be  shown  as  simply 
as  the  other  criteria  considered  here  and,  therefore, 
this  will  be  considered  in  detail  as  a  separate  subject 
in  Chapters  7,  8,  and  9.  Conclusions  concerning  the 
validity  of  the  summation  rule  as  a  model  for  component 
reliflbiJ-'ty  are  hel  l  in  abeyance  until  further  examination 
is  provided. 

5.4  Combined  Product  and  Summation  Rules 

It  was  shown  in  Section  5.2  that  early  failures 
could  not  be  combined  with  chance  or  wearout  failures  by 
use  of  the  product  rule  because  the  unified  reliability 
would  go  to  zero  at  the  end  of  the  early  life  period.  In 
Section  5.3  it  was  shown  that  this  problem  did  not  occur 
when  the  summation  rule  was  used  as  the  unified  reliability 
model.  A  third  possible  combination  of  reliabilities 
exists  and  there  is  ample  theoretical  basis  for  its  con¬ 
sideration  as  will  be  shown. 

In  Section  5.3  it  was  theorized  that  wearout  fail¬ 
ures  and  chance  failures  could  be  considered  as  separate 
failure  subpopulations.  A  re-examination  of  Fig.  4.2 
would  cast  some  doubt  on  this.  Figure  4,2a  clearly  shows 
two  subpopulations,  the  substandard  and  the  good.  The 
substandard  subpopulation  c<  ^tributes  the  early  failures, 
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whereas,  the  good  subpopulation  contributes  both  the  chance 
and  the  wearout  failures  (Fig.  4.2b,c).  It  was  noted  in 
Section  4.3  that  in  the  early  period  both  early  (substandard) 
and  chance  failures  occur.  This  means  that  in  the  early 
period  two  subpopulations  are  in  test  and  the  summation 
rule  may  be  applied.  In  the  chance  and  wearout  periods, 
however,  only  t‘>>  good  subpopulation  is  in  test,  the  sub¬ 
standard  items  having,  already  failed.  Tne  good  subpopula¬ 
tion  provides  both  the  chance  and  wearout  failures.  For  a 
good  component  to  survive  during  the  chance  and  wearout 
periods  it  must  survive  both  types  of  hazard,  chance  fail¬ 
ure  and  wearout  (deterioration)  failures.  Tiis  is  a  serial 
arrangement  of  reliabilities  to  which  the  oroduct  rule  must 
be  applied. 

The  unified  reliability  of  a  component  drawn  at 
random  from  a  population  containing  good  and  substandard 
components  is  found  directly  from  Bayes'  Theorem  to  be  the 
sum  of  two  terms:  the  probability  that  the  component  is 
substandard  times  its  reliability  if  it  is  substandard, 
plus  the  probability  that  the  component  is  good  times  its 
reliability  if  it  is  good  (14,  p.  57), 

Mathemat ical ly , 

N  N  +  N 

R  (t)  =  ~R  (t)  +  „  W  R  (t)  R  (t)  (5.16) 

ecw  Ne  N  e  w  '  ' 
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where  N  +  N  is  equal  to  the  number  of  good  components  in 
c  w 

test,  Ng.  The  other  functions  of  interest  are  found  by 
applying  the  general  equations  developed  in  Chapter  2  and 
in  Section  5.3  of  this  chapter  to  equation  (5.16).  From 
equation  (5.10) 

£ecw(t)  '  '  ft  [Recw(t)] 


For  the  combined  model  this  is 


Wfc)  ■  r  {-  ft  V*>}  +  ft  [Rc(t)  V£>l} 


which  may  be  written 


Wfc>  -  r  {-  ft  Re(t>)  +^NJ{Vt>  C-ftV^] 

[-  ft  Rc(t)l  1 

Therefore,  the  combined  probability  density  function  is 


N  N  +  N 

f  (t)  =  rp  f  (t)  +  — — ■ 
ecw  N  e  N 


From  equation  (5.12) 
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which  gives  the  combined  failure  rate 

,  (t)  _  rfe<t>  +  -84^tL,c<t)Vt)  +  Rw(t)fc(t)] 


N  N  +  N 
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(5.18) 


The  mission  reliability  for  a  mission  of  duration  T  start¬ 
ing  at  time  t  is  found  from  equation  (5.14)  to  be 


N  N  +  N 

r  .1  (t  *  T)  +  C  *  R  (t  +  T)R  (t  +  T) 

R  (t,T)  -  - 5— J ^ - — S- - 

ecw  W  N  +  N 

r  Re(t)  +  -hH1  Vc>  R„<£> 


(5.19) 


A  check  of  the  criteria  for  a  good  model  given  in  Section 
5.1  shows  that  the  combined  product  and  summation  rule  does 
meet  certain  of  the  specified  criteria.  The  phenomena  of 
early,  chance,  and  weaiout  failures  are  combined.  The  sin¬ 
gle  restricting  assumption  is  that  there  are  two  failure 
subpopulations,  one  exhibiting  only  early  failures  and  the 
other  exhibiting  both  chance  and  wearout  failures.  The 
model  is  mathematically  simple  and  if  the  assumption  above 
is  correct  the  model  would  be  useful  for  prediction  and 
theoretical  speculation.  Of  the  three  potential  models 
considered  in  this  section,  two  still  remain  and  require 


further  analysis. 


5.5  Comparison  of  Different  Models 

Thus  far  in  this  chapter  three  potential  unified 
reliability  models  have  been  considered:  the  reliability 
product  model,  the  reliability  summation  model,  and  a 
combined  product  and  summation  model.  The  product  model 
was  eliminated  because  it  went  to  zero  at  the  end  of  the 
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early  period.  Therefore  it  is  prudent,  before  going  to 

Chapter  6,  to  compare  the  two  remaining  models  at  several 

critical  time  periods  during  component  life  to  insure 

that  the  models  are  valid  during  the  entire  lifetime  and 

to  provide  a  comparison  of  the  two  equations  during  the 

different  periods.  The  equations  which  will  be  compared 

are  equation  (5.9),  the  reliability  summation  rule,  and 

equation  (5.16),  the  combined  product  and  summation  rule. 

At  time  t  ■  0  no  components  have  failed,  and, 

therefore,  R  *  R  «  R  =  1. 

'  e  c  w 

SUMMATION  RULE  (t  =  0) 
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ecw  N  N  N 

or 


R  (0) 
ecw 


N  +  N  +  N 
e  c  _ w 

N 


Since  N  +  N  +  N  -  N, 
ecw’ 


N 


R  (0)  -  g  -  1 
ecw  N 

COMBINED  RULE  (t  -  0) 

N  N  +  N 

R  (0)  -  —  (1)  +  ~  (1) 

ecw  N  N 


or, 


D  N+N+N_tt 

R  (0)  =  e  c  w  ■  N  _  , 
ecw  - jj - n  -  1 


(5.20) 


(5.21) 


Hence,  both  rules  give  the  desired  result  for  t  ■  0. 
Next  consider  the  period  0  t  t^.  The  individual 


53 


reliabilities  are  8  *  R  (t),  R  »  R  (t),  and  R 

e  ev  "  c  cv  '*  w 

wearout  does  not  begin  prior  to  t . 


*  1  since 


SUMMATION  RULE  (0  £  t  £  t  ) 

K  k  7 

Recwvt^  *  ~  Re^fc)  +  ~  Rc(t)  + 


(5.22) 


COMBINED  RULE  (0  <  t  £  t  ) 


R  (t)  -  rj£  R  (t)  + 
ecw  N  e 


N  +  N 

-Sr*  V‘> 


(5.23) 


Equations  (5.22)  and  (5.23)  vary  by  the  amount, 

Nw  r  -i 

jj—  JJ.  -  R  (t)  I  *  in  tests  in  which  wearout  is  not  con¬ 
sidered,  N  =0  and  both  equations  (5.22)  and  (5.23) 
w 

reduce  to 


N  N 

R  (t)  «  r  R  (O  +  r  R  (t) 

ecw  '  N  e'  N  c 


(5.24) 


Equation  (5.24)  is  given  by  Dietrich  (15,  p.  15),  Wong 
(4,  p.  19),  Polovko  (16,  p.  94),  and  Mendenhall  and  Hader 
(13,  p.  505). 

In  the  time  period  t^  ^  t<  0©  the  individual  reli¬ 
abilities  are  R  =  0,  R  =  R  (t),  and  R  =  R  (t). 

e  ’  c  c  w  w 


SUMMATION  RULE  (t.  4  t <•©) 

b  — 


N  N 

R  (t)  -  r£  R  (t)  +  -S  R  (t) 
ecw  N  c  N  w 


(5.25) 


COMBINKD  RULE  (tb  $  l <  ••  ) 
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N  +  N 

R  (c)  .  ----- v;  R  (t)  R  (t)  (5.26) 

ecw  N  c  w 

Although  equations  (5.25)  and  (5.26)  are  dissimilar,  forms 

of  both  have  been  referenced  in  the  literature  for  the  time 

period  after  burn-in.  Equation  (5.25)  is  given  by  Polovko 

(16,  p,  95)  and  by  Wong  (4,  p,  19).  Equation  (5.26)  is 

N  +  N 

given  when  N  “0  and  therefore  c  w  -  1  by  Bazovsky 
6  N 

(5,  p.  52)  and  by  Pieruschka  (17,  p.  73), 

The  two  models  have  now  been  considered  and  com¬ 
pared  for  three  critical  periods  of  time  during  component 
life  and  neither  has  been  eliminated.  To  be  of  use,  how¬ 
ever,  the  parameters  of  the  models  must  be  obtainable.  In 
Chapter  6  a  method  will  be  given  for  obtaining  the  para¬ 
meters  of  the  summation  rule.  The  combined  model  offers 
a  different  problem  in  parameter  determination  and  although 
considerable  effort  was  expended  to  find  an  analytical 
method  for  the  determination  of  R  (t)  and  R  (t),  no  satis- 

G  W 

factory  method  was  developed.  The  only  practical  technique 

found  for  applying  equati  n  (5.16)  without  prior  knowledge 

of  R  (t)  and  R  (t)  :  to  eliminate  one  of  the  parameters 

by  combining  tV  quantity  R  (t)  *  R  (t)  into  a  single  term 

c  w 

R  (t).  Th  result  of  this  combination  is  to  reduce  the 

VJ 
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combined  summation-product  model  to  an  ordinary  summation 
model  wi  :h  two  subpopulations 

Recw^  "  fT  Re!  ^  +  fT  RG^  (5.27) 

where  N  +  ■  N.  An  expression  of  this  form  is  used 

by  Kao  (18,  p.  397)  to  describe  electron  tube  failures. 

If  the  life  test  is  truncated  before  wearout.  equation 
(5.27)  is  equivalent  to  equation  (5.24).  One  method  of 
parameter  determination  for  equation  (5.27)  is  given  by 
Kao  (18)  and  another  is  given  in  Chapter  6  of  this  report. 
The  final  choice  as  to  which  model  to  use,  equation  (5.9) 
or  equation  (5.27),  must  be  based  on  the  data  itself.  A 
graphical  aid  to  assist  in  determining  the  number  of  under¬ 
lying  subpopulations  is  suggested  in  Chapter  6.  In  cases 
in  which  both  equation  (5.9)  and  equation  (5.27)  appear  to 
apply,  equaticn  (5.9)  is  preferred  because  with  its  addi¬ 
tional  parameters  it  is  the  more  powerful  expression. 
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5 . 6  Symbols  Introduced  in  Chapter  5 


£e(t) 

f  (O 


£  (t) 
w 

f  (t) 
ecw 

N 

N 

O 

N 

c 

N 

w 

N_ 


Nfc,£<t> 

Nw,f(t) 

N  ,(t) 
ecw,fv 

N  (t) 
e,s 


Nc,s(t) 


N  (t) 
w,s 


Qe(t) 

Qr(t) 

Qecw(t^ 


Early  probability  density  function 

Chance  probability  density  function 

Wearout  probability  density  function 

'Jnified  probability  density  function 

Total  items  in  test,  N  ■  N  +  N  A  N 

ecw 

Total  items  failing  in  early  mode  (substandard) 
Total  items  failing  in  chance  mode 
Total  items  failing  in  wearout  mode 
Total  good  items  in  test,  N_  -  1  -  N  »  N  +N 
Items  failing  in  early  mode  by  time  t 
Items  failing  in  chance  mode  by  time  t 
Items  failing  in  wearout  mode  by  time  t 
Items  failing  in  all  modes  combined  by  time  t 
Items  in  early  subpopulation  surviving  by 
time  t 

Items  in  chance  subpopulation  surviving  by 
time  t 

Items  in  wearout  subpopulation  surviving  by 
time  t 

Early  unreliability  N  f(t)/N 
Chance  unreliability  N  f(t)/N 

C|  l  C- 

Wearout  unreliability  N  f(t)/N 

wf  I  W 

Unified  unre liability  function 
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R  (t)  Early  (substandard)  reliability  N  (t)/N 

6  S  f  3  6 

R  (t)  Chance  reliability  N  (t)/N 

C  C  f  s  c 

R  (t)  Vearout  reliability  N  (t)/N 

W  W  «i  s  w 

RG(t)  Reliability  of  all  good  components  regard¬ 
less  of  failure  mechanism 

RecW(t)  Unified  reliability  function 

t^  Time  at  end  of  the  early  period 

t  Time  at  beginriing  of  wearout  period 

(t)  Unified  failure  rate  function 

’ ’ V/ 


CHAPTER  6 


TECHNIQUE  FOR  APPLYING  THE  RELIABILITY  SUMMATION  MODEL 


6 . 1  Introduction 

If  the  early,  chance,  and  v^earout  failure  periods 
are  each  modeled  by  an  individual  Weibull  Distribution, 
equation  (5.9)  may  be  written  as 

t-Jf  fie  t-)f  fic  t-lf  fiw 

N  -(-sr^)  N  -(-zr^>  N  -l-r) 

(t)=_£  +_£eV7^.'  +_£  e  ’f*' 

'  N  Ne  N 

(6.1) 


ecw 


where  the  subscripts  e,c,  and  w  indicate  individual  Weibull 
parameters  for  the  early,  chance,  and  wearout  subpopula¬ 
tions,  respectively.  In  order  to  use  equation  (6.1)  to 
compute  component  reliability  it  is  necessary  to  find  the 
three  subpopulation  sices  N  ,  N  ,  and  N  and  the  nine 

6  G  W 

Weibull  parameters,  $p,  ? f&,  ,  $  ,  9fc ,  Tw»  fiw>  and7(w- 


The  subpopulation  sizes  may  be  determined  by 
analytical  means  or  by  physical  failure  analysis,  but  the 
analytical  method  is  preferred  for  several  reasons: 

1.  Analytical  methods  are  usually  less  expensive 
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and  are  less  time  consuming  than  physical 
inspection. 

2.  The  components  may  not  lenu  themselves  to 
physicai  inspection  without  sophisticated  and 
expensive  test  equipment,  and  the  expenditure 
of  much  time. 

3.  The  failure  data  may  be  old  and  the  components 
no  longer  available  for  inspection. 

4.  The  components  may  be  destroyed  upon  failure, 
thus  making  it  impossible  to  conduct  a  failure 
analysis. 

5.  The  components  may  be  in  remote  or  otherwise 
inacfviS^.  'le  equipment. 

Kao  (18)  suggests  a  graphical  method  for  separating 
the  subpopulations  and  finding  five  Weibull  parameters  for 
e  mixei  population  consisting  of  two  subpopulations.  For 
the  purpose  of  this  report  an  attempt  was  made  to  extend 
Kao’s  original  method  so  that  it  would  apply  to  the  case  of 
three  subpopulations,  but  the  effort  was  abandoned  for 
reasons  which  are  fully  discussed  in  Appendix  A. 

A  graphical  method  for  subpopulation  separation  is 
recommended  in  Section  6.2  and  illustrative  examples  are 
given  in  Chapters  7,  8,  and  9. 
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6.2  Parameter  Determination 

rhe  following  steps  provide  a  method  for  separa¬ 
ting  a  mixed  Weibull  population  into  its  constituent  sub¬ 
populations  and  then  of  finding  the  parameters  ^,V T,  and 
for  each  of  the  subpopulations.  The  steps  are: 

1.  Compute  median  ranks  for  the  mixed  failure 
data  and  plot  median  ranks  versus  time  on  Weibull  proba¬ 
bility  paper. 

2.  By  visual  inspection  fit  straight  lines  to  the 
plotted  data  points.  For  the  case  of  early,  chance,  and 
wear out  subpopulations  there  will  be  three  relatively 
straight  line  segments  on  the  Weibull  plot  of  the  mixed 
failure  data.  Points  falling  closest  to  the  lower  line 
are  in  the  early  subpopulation,  points  falling  closest  to 
the  middle  line  are  in  the  chance  subpopu1 at ion,  and 
points  falling  closest  to  the  upper  line  are  in  the  wear- 
out  subpopulation. 

3.  N  is  calculated  as  the  cumulative  number  of 

e 

failures  represented  by  the  points  along  the  early  (lower) 

line,  N  is  calculated  as  the  cumulative  number  of  fail- 
c 

uras  represented  by  the  points  along  the  chance  (middle) 

lint.  N  is  calculated  as  the  cumulative  number  of  fail- 
w 

ures  represented  by  the  points  along  the  wearout  (upper) 


line. 
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4.  Compute  new  median  ranks  for  each  subpopulation 
based  on  sample  sizes  Nfi,  and  N^,  respectively, 

5,  Replot  each  subpopulation  on  individual  Weibull 
paper  and  determine  ,  and  £  for  each  subpopulation  by 
the  method  described  in  Section  3.3. 

Using  the  above  graphical  procedures,  all  of  the 
unknown  quantities  in  equation  (6.1)  may  be  estimated.  The 
method  described  in  this  chapter  will  be  applied  co  raw 
data  in  the  next  three  chapters.  Human  mortality  is  con¬ 
sidered  for  illustrative  purposes  in  Chapter  7  while 
Chapters  8  and  9  deal  with  Klystron  and  Magnetron  failures 
respectively. 


CHAPTER  7 


APPLICATION  OF  THE  RELIABILITY  SUMMATION  MODEL 
TO  HUMAN  MORTALITY 

7.1  The  Data  to  be  Analysed 

If  mortality  is  plotted  versus  time  for  a  large 
sample  of  humans,  the  resulting  curve  provides  a  classic 
example  of  a  mixed  population  exhibiting  infantile,  chance, 
and  deterioration  failures  or,  in  this  case,  deaths. 
Experience  has  shown  that  the  observed  failure  data  of  some 
electrical  and  complex  mechanical  systems  is  similar  to  that 
of  human  mortality.  For  this  reason  and  because  the  data 
are  familiar,  human  mortality  has  been  selected  as  the  sub¬ 
ject  for  this  first  application  of  the  reliability  summation 
model.  A  sample  population  of  1000  Americans  has  been  as¬ 
sumed  and  the  mortality  data  for  this  population  are  given 
in  Table  7.1.  The  sample  data  are  based  upon  survival 
probabilities  given  by  Reference  19.  The  data  of  Table  7.1 
are  grouped  into  twenty,  5-year  class  intervals  and  the 
mortality  and  force  of  mortality  histograms  are  plotted  in 
Figs.  7.1  and  7.2 
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Table  7.1  Mortality  Table  for  1000  Americans 


Age 

At  Class 
Interval 
End 

Point 

Probability  Number 
of  Survival  Dying 
at  Beginning  During 
of  5-Year  5-Year 

Period  Period 

Proba¬ 

bility 

Density 

Function 

Proba¬ 
bility 
of  Death 
at  End 
of  5-Year 
Period 

Force 

of 

Mortal¬ 

ity 

f  (t) 
ecw 

t 

R  (t) 
ecw 

Deaths 

f  (t) 
ecw 

Q  (t) 
xecw 

*ecw(t> 

5 

1.000 

13 

.013 

.013 

.013 

10 

.987 

6 

.006 

.019 

.006 

15 

.981 

6 

.006 

.025 

.006 

20 

.975 

8 

.008 

.033 

.008 

25 

.967 

9 

.009 

.042 

.009 

30 

.958 

10 

.010 

.052 

.012 

35 

.948 

11 

.011 

.063 

.012 

40 

.937 

13 

.013 

.076 

.014 

45 

.924 

19 

.019 

.095 

.021 

50 

.905 

29 

.029 

.124 

.032 

55 

.876 

43 

.043 

.167 

.049 

60 

.833 

63 

.063 

.230 

.076 

65 

.770 

90 

.090 

.320 

.117 

70 

.680 

121 

.121 

.441 

.178 

75 

.559 

146 

.146 

.587 

.261 

80 

.413 

150 

.150 

.737 

.363 

85 

.283 

132 

.132 

.869 

,466 

90 

.131 

84 

.084 

.953 

.640 

95 

.047 

37 

.037 

.990 

.789 

Figure  7. 2  Force  o 
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7 . 2  Parameter  Determination 

The  example  data  are  prepared  for  plotting  by  com¬ 
puting  the  median  rank  for  each  class  interval.  In  this 
case  the  sample  was  so  large  that  the  median  ranks  were 
considered  equal  to  the  cumulative  percent  failing.  The 
median  ranks  and  class  interval  end  points  are  tabulated  in 
Table  7.2  and  the  data  are  plotted  on  Weibull  probability 
paper  in  Fig.  7.3.  The  median  ranks  have  been  plotted  at 
the  class  interval  end  points  rather  than  at  the  mid-points 
as  suggested  by  some  authors,  because  the  failure  data  are 
actually  accumulated  up  to  the  class  interval  end  point  and 
therefore  this  would  seem  to  be  the  more  logical  location 
for  the  median  ranks.  Three  straight  lines  are  fitted 
amongst  the  points  plotted  in  Fig.  7.3,  and  it  is  determined 
by  visual  inspection  that  points  1  and  2  fall  along  the 
line  representing  the  early  subpopulation.  Points  4  through 
9  are  identified  with  the  chance  subpopulation  and  points  11 
through  20  are  placed  in  the  wearout  subpopulation.  Points 
3  and  10  each  fall  close  to  two  lines  and  the  final  deter¬ 
mination  as  to  which  subpopulation  the  questionable  points 
should  be  identified  with  is  not  made  until  after  the  points 
have  been  included  in  both  possible  suopcpulations  and  the 
subpopulations  replotted  on  Weibull  paper.  This  was  done 
and  the  best  straight  line  approximation  occurred  when 
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Table  7.2  Mortality  Data  Prepared  for  Plotting 


Point  Number 

Age 

Median  Rank 

1 

5 

1.3 

2 

10 

1.9 

3 

15 

2,5 

4 

20 

3.3 

r 

25 

4.2 

6 

30 

5.2 

7 

35 

6.3 

8 

40 

7.6 

9 

45 

9.5 

10 

50 

12.4 

11 

55 

16.7 

12 

60 

23.0 

13 

65 

32.0 

14 

70 

44.1 

15 

75 

58.7 

16 

80 

73.7 

17 

85 

86.9 

18 

90 

95.3 

19 

95 

99.0 

20 

100 

100.0 

ENT 
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point  3  was  included  in  the  chance  subpopulation  and  when 
point  10  was  placed  in  the  wearout  subpopulation.  A1 L  of 
the  points  have  now  been  identified  with  a  particular  sub¬ 
population.  The  subpopulation  sizes  are  determined  by  the 
total  failures  represented  by  the  points  which  fall  in  the 


particular  subpopulation.  Thus 

N  -  19 
e 

N  *76 
c 

N  *  905 
w 

New  median  ranks  are  calculated  for  subpopulation  replot¬ 
ting  by  considering  each  of  the  subpopulations  as  an  indi¬ 
vidual  population.  The  new  median  ranks  are  tabulated  in 
Table  7.3  and  the  subpopulations  are  replotted  individually 
in  Figs.  7.4,  7.5,  and  7.6.  From  these  replots,  the  follow¬ 
ing  Weibull  parameters  are  determined: 


Figure  7.4  Ye  “  0 

0e  -  1-60 

7f  -  4.85 
e 

Figure  7.5  V  -  0 

-  2-ei 

7fc  -  35.9 
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Table  7,3  .Mortality  Data  Prepared  for  Subpopulation  Replot 


Point 

Number 


Age 


Subpopu¬ 

lation 

Nf(t) 


Median 

Rank 


l 

t' 


*tor“ 


lift? 


679910 

/  Ohaac* 


2  3  45671 

Subpopulatloa  Rcpl 


9100 

Ot 


75 


figure  7.6  =  0 

K- 7-20 

»,  ■  77.5 

All  parameters  necessary  to  use  equation  (6.1)  have  been 
determined  avid  the  equation  may  now  be  used  to  find  reli¬ 
ability,  or  in  this  case  the  probability  of  living,  for 
all  positive  values  of  t.  For  example,  consider  t  =  30 
years.  Equation  (6.1)  is 
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Q  (30)  -  1  - 
new 


R  (30) 
ecw 


.94  b 


Q  (30)  =  .054 
new 


7. 3  Goodness-of-Fit 

For  the  purpose  of  comparison  the  observed  value  of  Qecw(30) 
from  the  sample  of  1000  is  found  to  be  .052.  The  value  of 
the  absolute  difference  between  the  expected  Q  (30)  and 

6  GW 

the  observed  Q  (30)  is 
new 

D30  =  .054  -  .052  =  .002 

From  the  Kolmogorov- Smirnov  Table  (Table  3.1)  for  N  greater 
than  35,  the  asymptotic  equations  are  used  to  find  the  max¬ 
imum  allowable  value  of  D  at  the  .05  significance  level 


D 


1.36 

N 


1.36 

1000 


=  .043 


To  fully  test  the  proposed  model  for  goodness-of-f it,  it  is 

necessary  to  find  at  each  class  interval  to  insure  that 

no  value  of  is  greater  than  .043.  This  has  been  done  and 

the  values  of  Q  Expected,  Q  observed,  and  D.  are  tabu- 
new  r  ecw  i 

lated  for  each  class  interval  in  Table  7.4.  The  maximum  D. 

l 

is  found  to  be  .007.  Since  .007  is  less  than  the  allowable 
maximum  difference  of  .043  it  is  concluded  that  the  summa¬ 
tion  model  for  three  subpopulations  does  meet  the  criteria 
established  in  Section  5.1  for  a  good  model.  The  expected 
and  the  observed  cumulative  failure  distributions  are 
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Table  7.4  Absolute  Difference  Between  Expected  and 

Observed  Cumulative  Mortality  Data  (Summation 
Model) 


Age 

Observed 

Expected 

Absolute 

Difference 

t 

Q  (t) 
xecw 

Q  (t) 
^ecw 

Di 

5 

.013 

.013 

.000 

10 

.019 

.020 

.001 

15 

.025 

.025 

.000 

20 

.033 

.032 

.001 

25 

.042 

.042 

.000 

30 

.052 

.054 

.002 

35 

.063 

.068 

.005 

40 

.076 

.083 

.007* 

45 

.095 

.010 

.005 

50 

.  124 

.127 

.003 

55 

.167 

.166 

.001 

60 

.230 

.226 

.004 

65 

.320 

.317 

.003 

70 

.441 

.440 

.001 

75 

.537 

.  589 

.002 

80 

.737 

.742 

.005 

85 

.869 

.871 

.002 

90 

.953 

.952 

.001 

95 

.990 

.988 

.002 

100 

1.000 

.998 

.002 

*  Maximum  absolute  difference 

Allowable  absolute  difference  at  the  .05  significance 
level  is  found  from  Table  3.1  to  be 


1.36  *  1.36 

JIT  4 1000 


043 
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plotted  for  visual  comparison  in  Fig.  7.7.  The  excellent 
correlation  between  the  observed  and  the  expected  data 
clearly  indicates  that  the  reliability  summation  model  is 
capable  of  accurately  representing  the  reliability  of  mixed 
populations . 

7 . 4  Analysis  of  Underlying  Failure  Causes-Early  Failures 
In  Chapter  4  it  was  shown  that  the  characteristic 
decreasing  failure  rate  of  the  early  failure  period  may  be 
produced  by  a  combination  of  subpopulations,  none  of  which 
individually  exhibits  a  decreasing  failure  rate.  An  inter¬ 
esting  example  of  the  phenomenon  is  provided  by  the  1958 
Commissioner's  Standard  Ordinary  Mortality  Table  (19).  In 
this  case  each  of  the  three  subpopulations  to  be  combined 
exhibits  a  monotonic  increasing  failure  rate  (force  of 
mortality)  as  indicated  by  the  subpopulation  shape  para¬ 
meters  each  being  greater  than  unity.  Yet,  when  these  sub¬ 
populations  are  combined  into  a  heterogeneous  population 
the  combined  failure  rate  is  initially  decreasing.  This 
phenomenon  may  be  explained  by  considering  the  interaction 
of  the  three  subpopulations.  Table  7.3  indicates  that  the 
early  subpopulation  is  composed  of  only  19  of  the  1000 
individuals  being  considered.  Because  the  early  subpop¬ 
ulation  exhibits  a  very  high  failure  rate  in  contrast  to 
the  useful  and  wearout  subpopulations  it  causes  an  initi¬ 
ally  high  overall  failure  rate  for  the  heterogeneous 
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Flgura  7  *7  not  of  Jtxpaotad  and  Obaarrad  Ouanlatlra  Mortality 
Vara ua  flat  (Suaaatlon  Nodal) 
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population.  During  the  first  5  years  after  birth  13  of  tho 
19  members  of  the  early  subpopulation  die  and,  therefore, 
the  future  impact  of  this  subpopulation  on  the  overall  fail¬ 
ure  rate  is  diminished.  After  ten  years  all  19  members  of 
the  early  subpopulation  have  died  and  the  failure  rate  of 
the  heterogeneous  population  has  decreased  to  its  lowest 
point,  thus  ending  the  early  failure  period. 

A  mathematical  explanation  of  the  decreasing  failure 
rate  phenomenon  during  the  early  portion  of  human  life  is 
provided  by  the  failure  rate  model,  equation  (5.13).  The 
subpopulation  values  for  probability  density  function  and 
reliability  have  been  determined  from  individual  Weibull 
models  for  each  subpopulation. 


Sef,(t)+  |5cf£(t)  +  ffjt) 

+  S“Rw(t) 


(5.13) 


At  t  -  5  years  the  failure  rate  equation  after  multiplying 

and  dividing  by  1000  yields 

^  (5)  .  19  (.650)  +  76  ( .004)  +  905(0) 

19  (.350)  +  76  (.996)  +  905(1) 

The  first  term  in  the  numerator  and  denominator  represents 

that  portion  of  the  heterogeneous  failure  rate  caused  by 

the  early  subpopulation.  The  arithmetic  is  done  in  detail 

below  to  demonstrate  this  effect  at  age  five. 
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*ecw<5> 


12.3  +  .3  +  0 
6.)  +  >5.6  +905 


.013 


Note  that  97  per  cent  of  the  contribution  to  failure  rate 
is  produced  by  the  early  subpopulation  at  age  5  years. 


At  age  10 


A 


ecw 


(10) 


19 ( . 309 )  +  76C.023)  +  905(0) 
19(.041)  +  76( .973)  +  905(1) 


This  reduces  to 


Aecw^®)  “  +  1. 75  -f  Q .  QQg 

.8  +  74  +905 

The  contribution  to  failure  rate  by  the  early  subpopulation 
is  now  only  77  per  cent  and  _he  failure  rate  has  decreased 
from  its  previous  value.  Consider  age  t  ■  15. 

^  (15)  .  19(,039)  +  76(,056)  +  905  (0) 

ec“  19(.002)  +  76(,917)  +  905  (1) 

This  reduces  to 

>  (15)  -  --1— -------  -  .005 

ecw  .038  +  69.7  +905 


The  contribution  of  the  early  subpopulation  is  now  only  15 
per  cent  ar.d  the  trend  in  failure  rate  is  no  longer  decreas¬ 
ing  thus  indicating  that  the  early  failure  period  has  ended. 

Useful  Life  Failures 


Examination  of  Figure  7.2  reveals  that  the  chance 


82 


or  useful  life  failure  period  starts  at  10  years  of  age  and 
extends  to  approximately  50  years  at  which  time  wearoul 
becomes  the  dominant  mode  of  failure.  The  slight  increase 
in  failure  rate  prior  to  50  years  is  attributed  to  the 
interaction  between  wearout  and  chance  failure  mechanisms. 
An  accident  which  causes  the  death  of  an  older  person  might 
only  cause  injury  to  a  younger  individual.  Thus,  even 
though  the  mechanism  of  failure  or  death  is  essentially  of 
the  chance  variety  the  force  of  mortality  is  not  strictly 
independent  of  age  as  it  is  In  the  ideal  case. 

wearout 

The  delineation  between  the  chance  and  wearout 
periods  L*  not  as  distinct  in  Figure  7.2  as  it  is  in  the 
theoretical  curve  of  Figure  1.1,  because  in  the  present 
illustration  both  periods  exhibit  an  increasing  failure 
rate.  However  when  the  cumulative  failures  are  plotted 
on  Weibull  probability  paper,  as  in  Figure  7.3,  the  delinea 
tion  between  the  chance  and  wearout  periods  is  clearly 
shown  to  occur  at  50  years  of  age.  Hisnan  mortality,  of 
course,  is  quite  dissimilar  from  the  failure  of  mechanical 
or  electrical  components.  This  example  has  served  the  pur¬ 
pose  of  placing  failure  phenomena  and  the  reliability 
summation  model  on  a  more  familiar  plane.  It  is  time  new 
to  apply  the  model  to  actual  military  equipment  component 
failure  data. 


CHAPTER  8 


APPLICATION  OF  THE  RELIABILITY  SUMMATION  MODEL 
TO  KLYSTRON  FAILURES 

8.1  The  Data  to  be  Analysed 

In  a  paper  titled,  H High- Power  High-Frequency 
Reliability  Techniques,”  Doyon  and  Siegman  (20)  have  pre¬ 
sented  some  klystron  failure  data  exhibiting  early,  chance, 
and  wearout  failures.  Their  data  for  92  klystrons  tested 
to  failure  are  the  basis  for  this  second  example.  Table 

8.1  is  a  tabulation  of  pertinent  failure  data  for  the  92 
klystrons  in  test.  The  data  have  been  grouped  into  twenty- 
seven  600  hour  class  intervals. 

8 . 2  Parameter  Determination 

The  class  interval  end  points  together  with  their 
respective  median  ranks  are  tabulated  in  Table  8.2  and 
plotted  on  Weibull  probability  paper  in  Fig.  8.1.  Three 
straight  lines  are  drawn  amongst  the  points  and  by  in¬ 
spection  St  is  determined  that  points  1  through  4  are  in 
the  early  subpopulation,  points  5  through  11  are  in  the 
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Table  8.1  Raw  Failure  Data  for  92  Klystrons 


Time 

at  Class 
Interval 
End 

Point 

Re 11 a-  Failures 

bility  at  During 
Beginning  Class 
of  Class  Interval 
Interval 

Proba¬ 

bility 

Density 

Function 

Unre¬ 
liabil¬ 
ity  at 

End  of 

Class 

Interval 

Failure 

Rate 

t 

R  (t) 
ecw 

Failures 

"i  rtr 

ecw 

Q  It) 

ecw 

Aecw^ 

600 

1.000 

23 

.2500 

.250 

.2500 

1200 

.'750 

9 

.0980 

.348 

.1305 

1800 

.652 

2 

.0218 

.370 

.0334 

2400 

.630 

3 

.0326 

.402 

.0517 

3000 

.598 

11 

.1200 

.521 

.2010 

3600 

.479 

6 

.0650 

.587 

.1360 

4200 

.413 

3 

.0326 

.619 

,0790 

4800 

.381 

6 

.0650 

.684 

.1705 

5400 

.316 

4 

.0453 

.728 

.1375 

6000 

.272 

3 

.0326 

.761 

.1200 

6600 

,23S 

4 

.0435 

.804 

.1820 

7200 

.196 

8 

.0870 

.891 

.4440 

7800 

.109 

3 

.0326 

.924 

.2990 

8400 

.076 

0 

.0000 

.924 

.0000 

9000 

.076 

1 

.0109 

.935 

.1436 

9600 

.065 

2 

.0218 

.956 

.3359 

10200 

.044 

2 

.0218 

.980 

.4960 

10800 

.020 

0 

.0000 

.980 

.0000 

11400 

,020 

0 

.0000 

.980 

.0000 

12000 

.020 

0 

.0000 

.980 

.0000 

12600 

.020 

0 

.0000 

.980 

.0000 

13200 

.020 

0 

.0000 

.980 

.0000 

13800 

.020 

0 

.0000 

.980 

.0000 

14400 

.020 

1 

.0109 

990 

.  5450 

15000 

.010 

0 

.0000 

.990 

.0000 

15600 

.010 

0 

.0000 

.990 

.0000 

16200 

.010 

1 

.0109 

1,000 

1.0900 

Table  8.2  Klystron  Failure  Data  Prepared  for  Plotting 


Point 

Number  Age 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 


17 


600 

1200 

1800 

2400 

3000 

3600 

4200 

4800 

5400 

6000 

6600 

7200 

7800 

8400 

9000 

9600 

10200 

10800 

11400 

12000 

12600 

13200 

13800 

14400 

15000 

15600 

16200 


Median 

Rank 


24.6 

34.4 

36.5 

39.7 

51.7 

58.2 

61.4 

68.0 

72.3 

75.5 

79.8 

88.5 
91.7 

92.9 
95.0 
97.2 


98.4 


18 


99.4 
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chance  subpopulation,  and  points  12  through  18  are  in  the 
wearout  subpopulation.  No  importance  is  attached  to  the 
fact  that  the  three  lines  in  this  case  intersect  at  a  com¬ 
mon  point,  as  the  only  purpose  of  the  mixed  population 
plot  is  to  identify  and  separate  the  three  subpopulations. 
By  totaling  the  failures  represented  by  each  of  the  points 
falling  in  a  particular  subpopulation,  the  subpopulation 
sizes  are  found  to  be 


N  -  37 
c 

Nw’18 

New  median  ranks  are  calculated  based  on  these  subpopulation 
sizes  and  these  data  are  tabulated  in  Table  8.3.  The  sub¬ 
populations  are  individually  replotted  in  Figs.  8.2,  8.3, 
and  8,4  and  the  following  Weibull  parameters  are  determined? 

Figure  8.2  j  -  0 

*e  “  .92 
7fe  -  650 

*  -  0 

c 

A  -  2.65 

Kc 

7?c  *  4390 


Figure  8.3 


88 


Figure  8.4 


1  -  0 
w  w 


fw  "  3.0i» 

vl  -  89r 3 

All  parameters  necessary  to  use  equation  (6.1)  have  been 
found.  The  equation  can  now  be  used  to  determine  the  ex¬ 
pected  reliability,  at  a  given  time,  of  a  klystron  randomly 
drawn  from  a  mixed  population  which  is  represented  by  the 
tested  sample.  For  example,  consider  t  =  4800  hours. 


Equation  (6.1)  is 


R  (t) 
ecw 


t-r  ?■ 

n  -<Tr> 

c  ^c 


N 

+  J£  e 

N 


t-jr  /w 

'w 


Substituting  numerical  values  for  all  parameters 


R  (4800) 
ecw 


,4800-Qx*^  r4800-0x  2.65 

-  e“  *  650  '  +  e  4390  ^ 

,4800-0x  3.00 
18  8900  ' 

+  ?2  6 

,402(. 00185)  +  ,402(.282)  +  ,196(.855)  =  .282 


If  the  sample  of  92  is  representative  and  if  equation  (6.1) 
is  valid,  a  klystron  randomly  drawn  at  time  t  =  0  can  be 
expected  to  have  a  28.2%  chance  of  surviving  4800  hours  of 
operation.  Conversely,  the  expected  probability  of  failing 
before  4800  hours  of  operation  is 


Q  (480°)  -  1  -  R  (4800)  -  1  -  .282  =.718 
ecw  ecw 
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Table  8.3  Klystron  Failure  Data  Prepared  for  Subpopula¬ 
tion  Replot 


Point 

Number 


Age 


Subpopu¬ 

lation 


Nf(t) 


Median 

Rank 


Early 

1 

600 

23 

60.7 

Subpopulation 
(N  -  37) 

2 

1200 

32 

84.7 

e 

3 

1800 

34 

90.1 

4 

2400 

37 

98.1 

Chance 

5 

3000 

11 

28.6 

Subpopulation 
(N  -  37) 

6 

3600 

17 

44.6 

c 

7 

4200 

20 

52.6 

8 

4800 

26 

68.8 

9 

5400 

30 

79.4 

10 

6000 

33 

87.5 

11 

6600 

37 

98.1 

Wearout 

12 

7200 

8 

41.8 

Subpopulation 
(N  -  18) 

13 

7800 

11 

58.1 

w 

14 

9000 

12 

63.6 

15 

9600 

14 

74.5 

16 

10200 

16 

85.4 

17 

14400 

17 

90.9 

18 

16200 

18 

96.3 

PERCENT 


WBftUU  KOftAftlUTY  Mm 


ENT 


PERCENT 
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.  !  (loodiU’Ss-nf-Klt 

For  Lhe  purpose  of  comparison,  the  observed  value 
of  Q  (4800)  from  the  tested  sample  of  92  klystrons  is 
.684  (Table  8.1).  The  absolute  difference  between  the  ob¬ 
served  and  the  expected  values  is 

D4800  =  *718  "  *684  “  *034 

From  the  Kolmogorov-Smirnov  Table  (Table  3.1)  for  N  greater 
than  35,  the  asymptotic  equations  are  used  to  find  the  max¬ 
imum  allowable  value  of  D  at  the  .05  significance  level. 

D  -  _  1.36  _ 

4  N  4^2 

To  fully  test  the  proposed  model  for  goodness-of-fit,  it 
is  necessary  to  find  at  each  class  interval  to  insure 
that  no  value  of  is  greater  than  .1415.  This  has  been 

done  and  the  values  of  Q  expected,  Q  observed,  and 

are  tabulated  for  each  class  interval  in  Table  8.4.  The 
maximum  is  found  to  be  .063,  Since  .063  is  less  than  the 
allowable  maximum  difference  of  .1415  it  is  concluded  that 
the  summation  model  for  three  failure  subpopulations 
does  meet  the  criteria  established  in  Section  5.1  for  a  good 
model.  The  expected  and  the  observed  cumulative  fail¬ 
ure  distributions  are  plotted  for  visual  comparison 
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Table  8,4  Absolute  Difference  Between  Expected  and 
Observed  C jmuiative  Klystron  Failures 
(Summation  Model) 


Tiir.e 

Observed 

Expected 

Absolute 

Difference 

t 

~“”q  rn  ~ 

Hecw 

“ ~“Q 

Hecw 

Di 

600 

.250 

.245 

.005 

1200 

.348 

.346 

.002 

1800 

.370 

.408 

.038 

2400 

.402 

.465 

.063* 

3000 

.521 

.525 

.004 

3600 

.587 

.591 

.004 

4200 

.619 

.657 

.038 

4800 

.684 

.718 

.034 

5400 

.728 

.772 

.044 

6000 

.761 

.815 

.054 

6600 

.804 

.848 

.044 

7200 

.891 

.875 

.016 

7800 

.924 

.896 

.028 

8400 

.924 

.914 

.010 

9000 

.935 

.930 

.005 

9600 

.956 

.944 

.012 

10200 

.980 

.956 

.024 

10800 

.980 

.967 

.013 

11400 

.980 

.976 

.004 

12000 

.980 

.983 

.003 

12600 

.980 

.989 

.009 

13200 

.980 

.992 

.012 

13800 

.980 

.995 

.015 

14400 

.990 

.997 

.007 

15000 

.990 

.998 

.003 

15600 

.990 

.999 

.009 

16200 

1.000 

1.000 

0.000 

★Maximum  absolute  difference 


Allowable  absolute  difference  at  the  ,05  significance  level 
is  found  from  Table  3.1  to  be 
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in  Fig,  8.5,  In  this  example  there  is  a  good  correlation 
between  the  observed  anc  the  expected  data  and  it  may  be 
concluded,  as  it  was  in  Chapter  7,  that  the  reliability 
summation  model  with  parameters  determined  by  the  graphical 
method  given  in  Chapter  6,  accurately  models  the  reliability 
of  mixea  failure  mode  populations. 

8 . 4  Analysis  of  Underlying  Failure  Causes-Early  Failures 

In  Fig.  8.1  the  four  points  which  fall  in  the  early 
subpopulation  represent  37  klystron  failures  or  40  per  cent 
of  the  heterogeneous  klystron  population.  This  large  early 
subpopulation  indicates  that  there  are  a  large  number  of 
substandard  items  in  the  mixed  population.  Unfortunately 
the  exact  number  of  substandard  klystrons  cannot  be  deter¬ 
mined  mathematically  because  the  early  subpopulation  is 
composed  of  chance  as  well  as  substandard  failures.  The 
effect  of  this  combination  is  to  produce  a  decreasing 
failure  rate  until  2400  hours  of  operation.  See  Fig.  8,6. 
After  2400  hours  the  klystron  population  is  completely 
burned  in. 

Useful  Life  Failures 

In  Fig.  8.6  the  useful  life  period  which  extends 
from  2400  hours  to  7200  hours  is  seen  to  be  nonmono¬ 
tonic  as  a  result  of  the  interaction  of  the  three  sub- 
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populations  which  each  have  an  impact  on  failure  rate 
during  the  period,  An  additional  49%  of  the  klystron 
population  fails  during  the  useful  life  period  leaving 
only  18  klystrons  to  fail  in  wearout. 

Wearout  Failures 

Although  wearout  begins  after  7200  hours  of  opera¬ 
tion,  the  wearout  failure  rate  does  not  exceed  the  maximum 
useful  life  failure  rate  until  after  9000  hours  of  opera¬ 
tion.  From  Fig.  8.6  it  is  seen  that  20  per  cent  of  klys¬ 
trons  with  greater  than  9600  hours  may  be  expected  to  fail 
within  the  next  600  hours. 

Conclusion  of  Analysis 

1.  The  large  size  of  the  early  subpopulation  sug¬ 
gests  that  improved  quality  control  measures  are  in  order 
for  future  klystron  production  of  the  type  analysed. 

2.  Because  of  the  high  failure  rate  exhibited 
during  useful  life,  a  burn-in  period  of  only  600  hours 
during  early  life  would  be  sufficient  to  reduce  population 
failure  rate  to  that  of  the  useful  life  period.  During 
the  600  hour  bum- in  approximately  20%  of  the  klystrons 
may  be  expected  to  fail  as  compared  to  40%  failures  if  the 
bum-i^  test  is  run  for  the  2400  hour  duration  of  early 
life. 

3.  Operational  reliability  of  klystrons  could  be 
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maintained  at  specified  levels  by  controlling  the  number  of 
hours  the  klystrons  are  permitted  to  operate.  For  example, 
it  might  be  specified  that  all  klystrons  in  operation  must 
have  an  80  per  cent  chance  of  survival  during  any  600  hour 
interval.  In  this  case  a  600  hour  burn- in  prior  to  oper¬ 
ational  employment  of  the  klystrons  should  reduce  the 
population  failure  rate  below  20  per  cent  at  an  estimated 
cost  of  20  per  cent  of  the  total  klystron  population. 

Next,  klystrons  surviving  8400  hours  of  use  could  be  removed 
from  operation  prior  to  failure  at  a  cost  of  an  additional 
8  per  cent  of  the  klystrons.  The  result  is  that  by  selec¬ 
tively  discarding  28  per  cent  of  the  klystron  population 
the  remaining  klystrons  would  have  an  80  per  cent  chance 
of  surviving  during  any  600  hour  mission. 


CHAPTER  9 


APPLICATION  OF  THE  RELIABILITY  SUMMATION  MODEL 
TO  MAGNETRON  FAILURES 

9 . 1  The  Data  to  be  Analysed 

Table  9.1  contains  failure  information  for  a  pop¬ 
ulation  of  38  magnetrons  during  operational  use.  The  in¬ 
formation  was  provided  by  the  U.S.  Army  Missile  Command 
located  at  Redstone  Arsenal,  Alabama.  The  accuracy  and 
the  analytical  power  of  the  proposed  model  will  be  exam¬ 
ined  in  this  chapter.  In  Chapter  10  the  results  obtained 
in  this  chapter  will  be  compared  with  those  obtained  using 
present  reliability  modeling  techniques. 

9.2  Parameter  Determination 

The  median  rank  and  age  at  failure  for  each  failed 
magnetron  is  tabulated  in  Table  9.2  and  plotted  on  Weibull 
probability  paper  in  Figure  9.1.  Three  straight  lines  are 
drawn  amongst  the  points  and  by  inspection  it  is  determined 
that  points  1  through  10  are  in  the  early  subpopulation, 
points  11  through  35  are  in  the  chance  or  useful  life 
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Table  9.1  Raw  Failure  Data  (38  Magnetrons) 


Time  at 

Cumulative 

Cumulative  % 

failure 

Failing  at 

Failing  at 

Time  *  t 

Time  =*  t 

t 

Failures 

Q  (t) 
xecw 

p.o 

1 

75% 

1.0 

2 

.053 

1.5 

3 

.079 

1.7 

4 

.105 

8.3 

5 

.132 

10.0 

6 

.158 

15.0 

7 

.184 

15.5 

8 

.210 

28.0 

9 

.237 

36.7 

10 

.263 

73.6 

11 

.290 

95.0 

12 

.316 

116.0 

13 

.342 

120.0 

14 

.368 

130.0 

15 

.395 

153.5 

16 

.421 

165.0 

17 

.447 

226.4 

18 

.474 

332.3 

19 

.500 

363.0 

20 

.526 

405.4 

21 

.551 

409.0 

22 

.579 

431.0 

23 

.605 

439.0 

24 

.631 

525.0 

25 

.658 

541.7 

26 

.684 

577.9 

27 

.710 

677.0 

28 

.736 

739.0 

29 

.763 

873.0 

30 

.790 

937.0 

31 

,798 

1144.0 

32 

.844 

1169.0 

33 

.849 

1297.0 

34 

.868 

1630.0 

35 

.898 

2088.9 

36 

.924 

2340.9 

37 

.975 

2343.0 

38 

.976 

I 


Tnble  9.2  Magnetron  Failure  Data  Prepared  for  Plotting 


Point  Age  Median 

Number 


Median 

Rank 


1.82 
4.42 
7.04 
9.64 
12.20 


10.0 

15.0 

15.5 

28.0 

36.7 


14.85 

17.45 

20.3 

22.6 

25.3 


73.6 

95.0 

116.0 

120.0 

130.0 


27.9 

30.4 

33.1 

35.6 

38.3 


153.5 

165.0 

226.4 

332.3 

363.0 


40.9 

43.5 
46.1 

48.6 
51.4 


405.4 

409.0 

431.0 

439.0 

525.0 


53.9 

56.5 
59.0 

61.6 
64.4 


541.7 

577.9 

677.0 

739.0 

873.0 


67.0 

69.5 
72.0 

74.6 
77.4 


937.0 

1144.0 

1169.0 

1297.0 

1630.0 

2088.9 

2340.9 
2343.0 


80.0 

82.5 

85.2 
87.7 
90.4 

93.0 

95.6 

98.2 
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subpopulation,  and  points  36  ,  37,  and  38  ara  in  the  wear- 
out  subpopulation.  Since  the  data  ate  not  grouped  each 
point  represents  a  single  failure  and  the  subpopulation 
sizes  are  found  to  be 

N  -  10 
e 

N  *  25 
c 

N  =  3 
w 

New  median  ranks  are  calculated  based  on  these  subpopula¬ 
tion  sizes  and  the  median  ranks  are  tabulated  in  Table  9.3. 
The  subpopulations  are  individually  replotted  in  figures 
9.2,  9.3,  and  9*4  and  the  following  Weibull  parameters  are 

determined: 

Figure  9.2  =  0 

fie  “  0,69 

%  - 

Figure  9.3  "  0 

h  "  ^ 

7f  -  620 

*c 

Figure  9.4  "  0 

A  -  20 

P  w 

m  -  2310 

tw 

All  parameters  necessary  to  use  equation  (6.1)  have  been 
found.  The  equation  can  now  be  used  to  determine  the  ex- 
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Table  9.3  Magnetron  Failure  Data  Prepared  for  Sub¬ 
population  Replot 


Failure 

Number 

Age 

Median 

Rank 

1 

0.0 

6.7 

2 

1.0 

16.3 

3 

1.5 

26.0 

4 

1.7 

35.6 

Early 

5 

8.3 

45.1 

Subpopulation 

6 

10.0 

54.8 

(N  -  10) 

7 

15.0 

64.4 

e 

8 

15,5 

74.0 

9 

28.0 

83.6 

10 

36.7 

93.4 

11 

73.6 

2.8 

12 

95.0 

6.7 

13 

116.0 

10.6 

14 

120.0 

14.5 

15 

130.0 

18.5 

16 

153.5 

22.4 

17 

165.0 

26,4 

18 

226.4 

30.3 

19 

332.3 

34.2 

20 

363.0 

38.2 

Useful  Life 

21 

405.4 

42.1 

Subpopulation 

22 

409.0 

46,0 

(N  -  25) 

23 

431.0 

50.0 

24 

439.0 

5-4.0 

25 

525.0 

58.0 

26 

541.7 

61.9 

27 

5/7.9 

65.7 

28 

677.0 

69.6 

29 

739.0 

73.6 

30 

073.0 

77.5 

31 

937.0 

81.5 

32 

1144.0 

85.5 

33 

1169.0 

89.5 

34 

1297.0 

93.4 

35 

1630.0 

97.3 

Uearout 

36 

2088,9 

20.6 

Subpopulation 

37 

2340.9 

50.0 

<Nw  -  3) 

38 

2343.0 

79.4 

CUMUL  ATIVE  PEW  CENT  FAILURE 
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pected  reliability,  at  a  given  time,  of  a  magnetron  random 
ly  drawn  from  a  mixed  population  which  is  represented  by 
the  observed  sample.  For  example,  consider  t  *  23  hours. 
Substituting  numerical  values  for  all  parameters  into 
equation  (6.1)  yields: 


,28-0 

'“12.1 


) 


.69 


,28-0 

'620 


) 


1.25 


,28-Ox 

"'“2310; 


20 


R 

ecw 


(t) 


e 


=  . 263< . 168)  +  .658( .979)  +  .079(1.000)  -  .767 
If  the  sample  of  38  magnetrons  is  representative  and  if 
equation  (6.1)  is  valid,  a  magnetron  randomly  drawn  at 
time  t  =0  can  be  expected  to  have  a  76.7  per  cent  chance 
of  surviving  28  hours  of  radiate  time.  Conversely,  the 
expected  probability  of  failing  before  28  hours  of  oper¬ 
ation  is 


Q  (28)  -  1  -R  (28)  =  1-.767  ■  .233 
xecw  ecw 

9.3  Goodness-of-Fit 

At  time  t  =  28  hours  the  observed  value  of  unre¬ 
liability,  Q  (28),  from  the  tested  sample  of  38  magne- 

fiCW 

trons  is  .237  (Table  9.1)„  The  absolute  difference  be¬ 
tween  the  observed  and  the  expected  values  of  unreliability 
is 


D28  =  .237  -  .233  =  .004 


From  Lhc  Kolmogorov-Smimov  Table  (Table  3.1)  for  N  greater 
than  35,  the  asymptotic  equations  are  used  to  find  the  max¬ 
imum  allowable  value  of  D  at  the  .05  significance  level. 


1.36  =  1.36 

JT  Jl8 


.220 


To  fully  test  the  proposed  model  for  goodness-of-f it,  it  is 
necessary  to  find  at  each  class  interval  to  insure  that  no 
value  of  is  greater  than  .220.  This  has  been  done  and  the 
values  of  Q  expected,  Q  observed,  and  D.  are  tabulated 
for  each  failure  time  in  Table  9.4.  The  maximum  is  found 
to  be  .070.  Since  .070  is  less  than  the  allowable  maximum 
difference  of  .220  it  is  concluded  that  the  summation  model 
for  three  failure  subpopulations  does  meet  the  criteria  es¬ 
tablished  in  Section  5.1  for  a  good  model.  The  expected  and 
observed  cumulative  failure  distributions  are  plotted  for 
visual  comparison  in  Fig.  9.5.  The  excellent  correlation  be¬ 
tween  the  observed  and  the  expected  data  clearly  indicates 
that  the  reliability  summation  model  is  capable  of  accurate¬ 
ly  representing  the  reliability  of  mixed  populations. 


9 . 4  Analysis  of  Underlying  Failure  O.uses-Early  Failures 
Fig.  9.6  is  a  superposition  of  the  subpopula- 
tion  failure  rates  based  on  the  expected  failures 
from  the  reliability  summation  model.  The  early 
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Table  9.4  Absolute  Difference  Between  Expected  and 
Observed  Cumulative  Magnetron  Failures 
(Summation  Model) 


Time 

Observed 

Expected 

Absolute 

Difference 

t 

Q  (t) 
xecw 

Q  (t) 
ecw 

Di 

0.0 

.026 

.000 

".626  ~ ' 

1.0 

.053 

.043 

.010 

1.5 

.079 

.056 

.023 

1.7 

.105 

.060 

.045 

8.3 

.132 

.144 

.012 

10.0 

.158 

.157 

.001 

15.0 

.184 

.187 

.003 

15.5 

.210 

.189 

.021 

28.0 

.237 

.233 

.004 

i8 . 7 

.263 

.251 

.012 

73.6 

.290 

.299 

.009 

95.0 

.316 

.319 

.003 

116.0 

.342 

.337 

.005 

120.0 

.368 

.340 

.028 

130.0 

.395 

.349 

.046 

153.5 

.421 

.368 

.053 

165.0 

.447 

.377 

.070* 

226.4 

.474 

.426 

.048 

332.3 

.500 

.505 

.005 

363.0 

.526 

.577 

.001 

405.4 

.551 

.556 

.005 

409,0 

.579 

.558 

.021 

431.0 

.605 

.572 

.033 

439.0 

.631 

.577 

.054 

525.0 

.658 

.629 

.029 

541.7 

.684 

.638 

.046 

577.9 

.710 

.658 

.052 

677.0 

.736 

.706 

.030 

739.0 

.763 

.732 

.031 

873.0 

.790 

.779 

.011 

973.0 

.815 

.798 

.017 

1144.0 

.841 

.844 

.003 

1169.0 

.869 

.849 

.020 

1297.0 

.895 

.868 

.027 

1630.0 

.921 

.898 

.023 

2088.9 

.947 

.924 

.023 

(continued) 
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Table  9.4  Continued 


Time 

Observed 

Expected 

Absolute 

Difference 

t 

Q  (t) 

()  (t) 

D. 

ecw 

ecw 

l 

2340.9 

.975 

.975 

.000 

2343,9 

1.000 

.976 

.024 

^Maximum  absolute  difference 

Allowable  absolute  difference  at  the  .05  significance  level 
is  found  from  Table  3.1  to  be 
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Figure  9.5  Plot  of  fixated  and  Obaar^ad  Ouaulativ*  Magnetron 
Failures  Ter sue  Time  14  Ho^rs  (SUaaatlon  Modal) 
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figure  9*6  Plot  of  ^ggoof  Atogaotroa  Sufcpopulatlon  Failure  Rates 
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subpopulation  exhibits  a  monotonic  decreasing  failure  rate. 
This  is  reflected  in  the  heterogeneous  population  as  a 
decreasing  failure  rate  during  the  first  100  hours  of 
operation.  See  Fig.  9.7.  The  non-grouped  data  of  Fig. 

9.1  more  accurately  pinpoints  the  end  of  the  early  failure 
period  at  36.7  hours.  Therefore,  a  required  bum- in  period 
of  approximately  40  hours  as  part  of  the  manufacturer’s 
quality  control  program  would  eliminate  the  early  failure 
period  prior  to  placing  the  magnetrons  into  the  Army  s  ;pv>ly 
system.  I he  advantages  to  the  Army  of  such  an  arrangement 
are  three  fold: 

1.  Radar  down  time  due  to  faulty  magnetrons  wot ' J 
be  greatly  reduced  with  a  subsequent  increase  in  operational 
readiness. 

2.  Fewer  magnetrons  would  be  handled  in  the  supply 
and  maintenance  systems. 

3.  The  number  of  magnetrons  of  this  type  required 
by  the  Army  would  be  reduced  by  approximately  one  fourth,  a 
significant  savings. 

Useful  Life  Failures 

The  useful  life  failure  period  extends  from  40  hours 
to  2000  hours  and  unlike  the  theoretical,  chance  failure 
period  it  is  not  a  purely  constant  failure  rate  period, 


117 


Figure  9.7  shows  the  failure  rate  during  useful  life  to 
vary  from  a  high  of  .14  to  a  low  of  .048  failur  s  per  100 
hours.  This  indicates  that  the  useful  life  failure  mechan¬ 
ism  is  not  strictly  random  but  is  a  combination  of  two  or 
more  failure  mechanisms.  A  simple  explanation  of  this 
dependance  is  provided  by  a  hypothetical  and  highly  simpli¬ 
fied  illustrative  life  test  of  automobile  tires.  During 
the  life  test  those  tires  which  threw  their  tread  are  con¬ 
sidered  substandard;  those  which  fail  as  a  result  of  punc¬ 
tures  are  considered  chance  failures;  and  those  which  fail 
as  a  result  of  tread  wear  are  considered  wearout  failures. 
Theoretically,  the  failure  rate  of  the  tires  undergoing 
the  life  test  is  initially  expected  to  decrease  as  the  high 
failure  rate  substandard  tires  fail  and  are  removed  from 
test.  The  population  is  then  expected  to  exhibit  a  con¬ 
stant  failure  rate  during  useful  life  as  the  sole  failure 
mecha  \i3m  during  this  period  is  random  punctures;  finally, 
as  the  tires  become  bald  the  failure  rate  increases  until 
all  the  remaining  tires  have  failed.  An  actual  test  would 
vary  from  the  theoretical  example  above  during  useful  life 
because  the  puncture  failure  mechanism  is  not,  in  reality, 
independent  of  tire  wear.  As  the  tread  becomes  worn, 
it  loses  thickness  and  increasingly  smaller  objects  will  be 
able  to  penetrate  and  cause  punctures;  therefore,  the 
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puncture  failure  mechanism  is  not  strictly  random  but  is 
also  dependent  upon  tire  wear. 

It  may  be  jpeculated  that  a  similar  type  of  de¬ 
pendence  causes  the  useful  life  of  the  magnetrons  to  exhibit 
a  nonmonotonic  failure  rate.  The  magnetron  case  is  even 
more  complex  than  the  tire  example  because  both  early  find 
wearout  mechanisms  have  an  impact  during  useful  life. 

Wearout  Failures 

Wearout  begins  at  approximately  2000  hours.  The 
wearcut  subpopulation  is  extremely  small  and  only  3  magne¬ 
trons  of  38  failed  during  the  wearout  period. 

Conclusion  of  Analysis 

Approximately  one  fourth  of  the  magnetrons  will 
fail  prior  to  forty  hours  of  operation.  An  obvious  savings 
would  accrue  to  the  Array  in  time,  money,  material,  and 
manpower  if  these  early  failures  could  be  isolated  prior  to 
being  placed  in  supply  and  maintenance  channels  or  into 
radars.  This  could  be  accomplished  by  reauiring  the  manu¬ 
facturer  to  previa  a  forty  hour  burn-in  period  before 
releasing  magnetrons  cor  operational  use. 


CHAPTER  10 


COMPARISON  OF  PRESENT  AND  PROPOSED  MODELS 

10 . 1  Weibull  Model  of  Magnetron  Failures 

In  the  past  it  has  been  expedient  to  model  observed 
failure  rate  data  with  simple  expressions  capable  of  por¬ 
traying  only  monotonic  failure  rates.  Inherent  in  this 
measure  is  the  assumption  that  a  component  population  will 
not  exhibit  both  decreasing  and  increasing  failure  rates 
during  the  span  of  population  life.  In  the  case  of  the 
magnetron  data  presented  in  Chapter  9  both  government  and 
industry  have  accepted  the  Weibull  distribution  with 
?  =  .63,  X”  520,  and  K-  0  as  an  appropriate  model  (Fig¬ 
ure  10.1).  Since  £  is  less  than  unity,  a  monotonic  de¬ 
creasing  failure  rate  is  indicated  during  the  entire 
magnetron  life.  This  state  of  affairs  is  contradictory  tc 
classic  reliability  theory  and  defies  either  practical  or 
theoretical  explanation.  Nevertheless,  the  Weibull  model 
with  the  indicated  parameters  can  be  shown  to  fit  the  ob¬ 
served  failure  data  at  the  .05  significance  level  using 
the  Kolmogorov- Smirnov  Te3t  for  goodness-of-f it  (Table  10.1). 
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Table  10. I  Absolute  Difference  between  Expected  and 
Observed  Cumulative  Magnetron  Failures 
(Weibull  Distribution) 


T  i  me 

Obse rved 

Expected  Ab: 

!)i  1 

O  1  III  V 

f  fere uce 

L 

Q  (L) 

Q(L) 

I). 

l 

0.0 

.026 

“/ooo 

.026 

1.0 

.053 

.019 

.034 

1.5 

.079 

.025 

.054 

1.7 

.105 

.027 

.078 

8.3 

.132 

.071 

.061 

10.0 

.158 

.080 

.078 

15.0 

.184 

.102 

.082 

15.5 

.210 

.104 

,  106* 

28.0 

.237 

.147 

.090 

36.7 

.263 

.172 

.091 

73.6 

.290 

.253 

.037 

95.0 

.316 

.290 

.026 

116.0 

.342 

.322 

.020 

120.0 

.368 

.328 

.040 

130.0 

.395 

.341 

.054 

153.5 

.421 

.371 

.050 

165.0 

.447 

.384 

.063 

226.4 

.474 

.447 

.027 

332.3 

.500 

.530 

.030 

363.0 

.526 

.549 

.023 

405.4 

.551 

.575 

.024 

409.0 

.579 

.577 

.002 

431.0 

.605 

.589 

.016 

439.0 

.631 

.593 

.038 

525.0 

.658 

.634 

.024 

541.7 

.684 

.642 

.042 

577.9 

.710 

.657 

.053 

677.0 

.736 

.693 

.043 

739.0 

.763 

.713 

.050 

“73.0 

,790 

.750 

.040 

037.0 

.815 

.765 

.050 

1 144.0 

.841 

.807 

.034 

1  169.0 

.869 

.811 

.058 

1297.0 

.895 

.831 

.064 

1630.0 

.921 

.871 

.050 

2088 . 9 

.947 

.909 

.038 

2  340.9 

.975 

.924 

.051 

2  34  3.0 

1.000 

.924 

.076 

Maximum 

absolute  difference  . 

106 

Al lowable 
is  found 

absolute  difference 
from  Table  3.i  to  be 

at  the  .05  significance  level 

1.36  1.36  -  . 
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Tims,  l  hr  Weil.nli  distribution  is  (IrmonsL  raled  Lo  hr  ;i 
satisfactory  expression  tor  fitting  a  curve  to  the  observed 
magnetron  failure  date.  However,  the  model  provides  little 
insight  into  the  mechanics  cf  failure  and  ./ould  not  be  of 
great  use  in  either  reliability  improvement  or  theoretical 
speculation  because  of  its  inexplicable  monotonic  decreas¬ 
ing  failure  rate. 

10.2  Comparison  of  Goodness-of-Fit 

Figure  10.2  provides  a  comparison  of  the  expected 
cumulative  failures  as  determined  by  both  the  proposed 
and  the  standard  Weibull  models.  As  might  be  expected, 
because  of  the  larger  number  of  parameters  the  proposed 
mode!  fits  the  observed  data  more  closely  than  does  the 
standard  Weibull  distribution.  While  the  Weibull  data  is 
seen  to  falL  below  tie  observed  cumulative  data  after  500 
hours  of  radiate  time  the  summation  model  closely  ap¬ 
proximates  the  observed  data  throughout  the  entire  span  of 
component  life.  The  maximum  absolute  difference  between 
expected  and  observed  cumulative  failures  is  .070  (Table 
9.4)  for  the  proposed  summation  model  and  .106  (Table  10.1) 
for  the  Weibull  distribution.  The  allowable  maximum 
difference  at  the  .05  significance  level  for  the  Kolmo- 
gorov-Smirnov  gooaness-of-f it  test  is  .220.  This  indicates 
that  both  models  arc  adequate  at  the  .05  significance  level 
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Tor  curve  fitting  purposes,  although  the  proposed  model  is 
the  more  accurate  of  Lhe  two. 

10.3  Comparison  of  Failure  Rate 

At  this  point  one  might  question  the  use  of  a  com¬ 
plex  model  to  gain  unrequired  accuracy  when  the  more  simple 
Weibull  distribution  has  been  shown  to  be  sufficient.  The 
answer  1,2s  in  the  analysis  of  the  failure  rate  curves  of 
the  two  expressions.  The  Weibull  failure  rate  is  monotonic 
decreasing  (Figure  10.3)  which  is  inconsistent  with  relia¬ 
bility  theory.  The  proposed  model  failure  rate  is,  on  the 
other  hand,  nonmonotonic:  first  decreasing,  then  showing  a 
slight  increase  and  decrease,  followed  by  a  dramatic  in¬ 
crease  which  continues  to  the  end  of  component  life  (Figure 
10.3).  Since  the  proposed  model  has  been  shown  to  be  a 
more  accurate  model  of  observed  data  it  is  logical  to  as¬ 
sume  that  the  failure  rate  behavior  described  by  the  sum¬ 
mation  model  is  also  more  accurate.  Since  the  failure  rate 
of  the  summation  model  relates  fairly  well  to  reliability 
theory,  the  model  becomes  useful  for  theoretical  specula¬ 
tion  concerning  failure  cause. 

10.4  Comparison  of  Analytical  Power 

Figure  10.1  is  the  standard  method  of  plotting  fail¬ 
ure  data  and  obtaining  a  straight  line  fit.  The  fact  -hat 


FAt 


1 1 


•  t  i 


>■  i-vij? 


s  .  n 


I  < 


Figure  1Q.3  Weibull  an4 

Rates  Versus  Ti*e  M 


126 


the  straight  line  does  not  fLt  v’ell.  at  the  tails  of  the  data 
is  usually  ignored  or  considered  inconsequent i at  ns  it  was 
in  this  case.  Figure  9.1  is  the-  same  failure  data,  but  this 
time  with  three  straight  lines  fit  to  it.  The  theoretical 
basis  for  the  three  lines  was  established  in  Section  5.3, 

The  advantages  of  using  the  reliability  summation  rule  rath¬ 
er  than  the  conmonly  used  monotonic  failure  rate  distribu¬ 
tions  are: 

1.  The  proposed  expression  models  failure  data 
more  accurately  than  expressions  currently  being  used  for 
this  purpose. 

2.  The  proposed  technique  is  sufficiently  flexible 
to  model  data  that  cannot  be  modeled  by  the  usual  statistical 
methods.  Human  mortality  is  an  example  of  this. 

3.  The  proposed  model  yields  information  about 
failure  cause  which  is  obscured  by  present  methods.  For 
example,  no  wearout  was  evident  in  the  Weibull  plot  of 
magnetron  failures;  whereas,  the  summation  model  indicates 

a  vearout  period  extending  from  2000  to  2500  hours.  Further¬ 
more  the  Weibull  plot  gives  no  indication  of  when  the  burn- 
in  or  early  period  ends  sinc^  the  entire  Weibull  failure 
rate  curve  is  monotonic  decreasing.  The  summation  model, 
on  the  other  hand,  definitely  indicates  that  burn-m  is 
terminated  after  10°  hours  of  operation.  This  is  important 
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information  for  managers  as  well  as  theoreticians  and  it 
is  totally  obscured  by  present  modeling  techniques. 


CHAPTER  11 


SUMMARY  AND  RECOMMEN DAT IONS 


11.1  Summary 

In  this  report  three  unified  expressions  have  been 
considered  to  see  which  best  describes  the  reliability  of 
components  drawn  randomly  from  a  population  of  components 
exhibiting  failures  of  the  early,  chance,  and  wearout  type. 
The  product,  model , 

R  (t)  ~  (t)  R  (t)  '<  (t)  (5.2) 

ecw  a  c  w 

was  eliminated  because  it  indicates  incorrectly  that  the 
unified  reliability  is  zero  after  the  population  is  burned 
in. 

The  combined  summation  and  product  model 
N  N 

R.cv(t)  ■  >T  Re(t>  *  (T  Rc(t>  V°  0.16) 

appears  to  be  theoretically  sour'd  but  no  analytical  r  s 

cf  determining  the  individual  reliabilities  R  (t)  \nd  R  (t) 

c  w 

from  observed  failure  data  was  found  without  combining  th_'n 


128 


129 


Into  the  single  reliability  Rr(t)  =  R  (t)  F  (t).  Unfor- 
tunately,  this  combination  destroys  the  usefulness  of  the 
exprtssion  as  a  tool  for  analysing  the  individual  chance 
and  wearout  failure  modes. 

Less  sound  theoretically  but  perhaps  more  useful 
than  the  combined  model  is  the  summation  model, 

Recw(t)  -  FT  Re(t>  +  T  Rc(t)  +  T  Rv(C)  <5'9> 

It  is  loss  sound  because  its  basis  is  that  there  are  three 
individual  failure  subpopulations,  which  is  not  quite  true. 
Theoretically,  there  are  only  two  failure  subpopulations, 
the  substandard  and  the  good.  The  substandard  population 
providing  early  failures  and  the  good  subpopulation  provid¬ 
ing  both  chance  and  wearout  failures.  This  means  that  wearout 
is  not  a  separate  subpopulaticn  and  components  must  survive 
the  chance  period  in  order  to  deteriorate  and  eventually  be¬ 
come  a  wearout  failure.  If  wearout  was  in  fact  a  separate 
subpopulatton,  components  belonging  to  this  subpopulation  would 
not  be  subjected  to  the  chance  failure  mechanism.  However,  even 
witn  this  theoretical  limitation,  the  summation  model  does 
correlate  well  with  observed  data  and  it  is  the  only  model 
of  the  three  considered  which  provides  specific  information 
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concerning  all  three  failure  modes. 

In  comparison  with  present  modeling  techniques  the 
summation  model  was  shown  to  be  superior  in  three  ways: 

1.  It  is  more  accurate  than  methods  presently  in 


use. 

2.  It's  greater  flexibility  permits  the  modeling 
of  data  which  is  beyond  the  capabilities  of  present  statis¬ 
tical  methods . 

3.  It  yields  important  information  for  use  by 
managers  as  well  as  theoreticians  concerning  failure  periods 
and  underlying  failure  causes  information  which  is  ob¬ 
scured  by  present  modeling  methods. 


11.2  Recommendations 

It  has  been  shown  in  this  thesis  that  the  proposed 
summation  model,  with  graphically  determined  subpopulation 
sizes  and  beibull  parameters,  may  be  used  very  satisfac¬ 
torily  to  represent  observed  failure  data  at  the  .05  sig¬ 
nificance  level  using  the  Kolmogorov- Smirnov  Test.  Further, 
it  has  been  demonstrated  that  the  proposed  model  offers 
significant  advantages  over  present  modeling  techniques. 
Therefore,  it  is  recommended  that  the  reliability  summation 
model  be  adopted  for  use  in  those  cases  in  which  failure 
data  encompass  the  three  failure  periods:  eirly,  chance, 
and  wearout.  This  might  include  complex  systems  such  as 
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tanks  and  mechanical  components  such  as  fuel,  pumps  in 
addition  to  electronic  components. 

The  adoption  of  the  proposed  modeling  technique 
should  result  in  a  better  analysis  of  the  manufacturer's 
quality  control;  it  v/ouid  pinpoint  the  required  burn-in 
period  and  the  time  at  which  wearout  begins;  and  this  know¬ 
ledge,  effectively  applied,  would  reduce  the  number  of  sub¬ 
standard  components  accepted  by  the  military  services  and 
subsequently  placed  into  the  respective  supply  systems. 

With  fewer  unreliable  repair  parts  or  components  available 
in  supply  channels,  maintenance  required  on  end  items  would 
gc  down  while  operational  readiness  would  increase.  In 
the  exanq  le  of  magnetrons  used  by  the  Army  it  was  =hown 
that  approximately  one  quarter  of  the  type  magnetron'  con¬ 
sidered  were  unreliable  and  failed  shortly  after  installa¬ 
tion  into  operational  radars.  Had  the  methods  proposed  in 
this  thesis  been  employed,  an  additional  burn-in  period 
would  have  been  required  of  the  manufacturer  and  the  un¬ 
reliable  magnetrons  would  have  tailed  prior  to  entry  into 
the  Army  supply  system  with  a  resultant  savings  in  money, 
manpower,  storage  and  transportation  requirements,  and  an 
increase  in  end  item  operational  readiness. 
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APPENDIX  A 


ATTEMPTED  EXTENSION  OF  KAO'S  METHOD  FOR 
GRAPHICAL  ESTIMATION  OF  MIXED  WEIBUIL 
PARAMETERS  (18) 

In  order  to  use  equation  (6,1)  to  compute  component 
reliability,  it  is  first  necessary  to  find  the  three  sub¬ 
population  sizes  N  ,  N  ,  and  N  and  the  respective  Weibull 

G  G  V? 

parameters  4-  If  Hf  <v-  K  and7^w<  Kao 
(18)  suggests  a  method  for  finding  certain  of  these  para¬ 
meters  for  the  case  of  two  subpopulations.  An  unsuccessful 
attempt  was  made  to  extend  Kao's  graphical  procedure  to  the 
case  of  three  subpopulations.  The  steps  of  the  attempted 
extended  procedure  are  given,  followed  by  a  discussion  of 
the  problems  which  arise  in  their  use.  The  human  mortality 
data  of  Chapter  7  are  used  to  illustrate  the  procedure.  The 
steps  arc: 

1.  Tabulate  the  median  ranks  of  the  mixed  popula¬ 
tion  for  each  class  interval  (Table  7.2)  and  plot  the  data 
on  Weibull  probability  paper  (Fig.  A.l). 

2.  Fit  three  straight  lines  to  the  plotted  data  by 
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by  visual  inspection  and  designate  the  lines 


f%'  and  ^  q;  (Fig.  A.l). 

N 

3.  Extend  the  ~  line  until  it  intersects  the 
upper  border  line  (99.9%  line).  From  this  intersection 
drop  a  vertical  line,  designated  t^^,  representing  the 
estimated  time  at  which  100  percent  of  the  components  will 
have  failed  (Fig.  A.l). 

N 

4.  Extend  the  ^p  line  upward  until  it  inter¬ 
sects  the  t  line.  The  height  of  this  intersection  read 

N 

on  the  percent  failure  scale  gives  the  estimate  of  ^p  in 


percent  (Fig.  A.l). 


N 


5.  Extend  the  p-  line  upward  until  it  inter¬ 
sects  the  line.  The  height  of  this  intersection  read 


N  +  N 
e  c 


on  the  percent  failure  scale  gives  the  estimate  of  N 

in  percent  (Fig.  A.l). 

N  N  N 

6  C  W 

6.  Since  p-  +  jp  +  »  1  the  three  subpopulation 


N 


N 


sizes  N  4  N  ,  and  N  are  readily  determined. 

€  c  w 

7.  The  location  parameter  for  each  subpopulation 

N  N 

is  determined  by  extending  the  lines  sp  Q* ,  rp  QI  and 

^  W  €  N  v* 

W 

jp  downward  until  they  intersect  the  time  axis  (abscissa). 


The  individual  values  are  the  times  read  at  these  inter- 

N  N 

sections.  Normally,  the  ^  line  and  the  ^p  line  will 
not  intersect  the  positive  time  axis  and  in  this  case 
\  «  t -  °  (Fig.  A.l). 
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8.  Categorize  the  failures  by  relating  them  to 
the  particular  subpopulation  line  upon  which  they  fall. 

If  a  point  falls  on  or  near  both  lines  it  may  be  inclu.ied 
in  both  subpopulations . 

9.  Sufficient  information  has  now  been  deter¬ 
mined  from  the  observed  failure  data  to  separate  the  mixed 
population  into  its  three  constituent  subpopulations. 
Calculate  new  median  ranks  by  considering  the  subpopula¬ 
tions  individually  (Table  A.l).  Plot  the  median  ranks 
versus  the  individual  t- %  values  with  8  as  determined  in 
Step  7  (Fig.  A. 2). 

10.  Fit  three  straight  lines  to  the  subpopulation 

N  N  N 

plots  and  designate  the  lines  ~  Qc»  and  > 

respectively  (Fig.  A. 2). 

11.  Find  the  individual  values  of  end  $  for  each 
subpopulation  using  the  method  described  in  Section  3.3 
(Fig.  A. 2) . 

The  following  parameters  are  now  determined: 

N 


-  .062, 

<e  -  0, 

P  -  .65, 77  -  49 
*  e  'e 

a 

• 

*— > 

03 

00 

<c  '  °<  1 

?c  "  2-5,  7TC  -  57 

-  .750, 

-  20- 

K  ■  - 54 
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Table  A. 1  Mortality  Data  Prepared  for  Subpopulation 
Replot  (Extended  Method) 


Point 

Number 

# 

Age  at 
Class 
Inter¬ 
val 

End 

Point 

t 

Subpop¬ 

ulation 

Values 

for 

t-r 

Subpop¬ 

ulation 

Cumula¬ 

tive 

Failures 

Subpop¬ 

ulation 

Median 

Ranks 

M.R. 

Early 

Subpopulation 

1 

5 

5 

13 

20.3 

(Nfi-62,%-0 

2 

10 

10 

19 

30.0 

Chance 

3 

15 

15 

6 

3.0 

Subpopulation 

4 

20 

20 

14 

7.3 

(N  -188, <-o; 

5 

25 

25 

23 

12.0 

c  c 

6 

30 

30 

33 

17.4 

7 

35 

35 

44 

23.2 

8 

40 

40 

57 

30.1 

9 

45 

45 

76 

40.1 

Wearout 

10 

50 

30 

29 

3.8 

Subpopulation 

11 

55 

35 

72 

9.6 

(N  -750,  <  -20) 

12 

60 

40 

135 

18.0 

w  w 

13 

65 

45 

225 

30.0 

14 

70 

50 

346 

44.0 

15 

75 

55 

492 

65.5 

16 

80 

60 

642 

85.6 

17 

85 

65 

774 

18 

90 

70 

858 

19 

95 

75 

895 

20 

100 

80 

505 
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Table  A. 2  is  a  tabulation  of  the  observed  0  , 

ecw 

the  expected  Q  ,  and  the  absolute  difference  D.  com- 

GCW 

puted  at  each  class  interval  end  point.  The  maximum  value 
of  is  .124  which  is  greater  than  the  maximum  allowable 
of  .043  at  the  .05  significance  level.  It  can  be  concluded, 
therefore,  that  the  parameters  found  by  the  extended  method 
do  not  describe  the  observed  data  at  the  .05  significance 
level. 

Two  problems  associated  with  the  extended  method 
have  been  isolated: 

1.  There  is  no  apparent  justification  for  using 
the  y  parameter  as  a  delay  factor  rather  than  for  curva¬ 
ture  correction  a3  suggested  by  most  writers  in  this  area. 
Furthermore,  the  determination  of  the  ^parameter  from  the 
mixed  plot  rather  than  from  the  subpopulation  replots 
unnecessarily  limits  the  flexibility  of  the  Weibull  equa¬ 
tions.  This  criticism  pertains  to  both  the  original  and 
the  extended  Kao  procedures* 

2.  In  the  extended  procedure  the  approximation 
of  the  subpopulation  sites  is  quite  inaccurate.  This  re¬ 
sults  in  inaccurate  calculation  of  subpopulation  median 
ranks  which  leads  directly  to  inaccurate  scale  and  shape 
parameters.  In  addition,  points  which  indicated  median 
ranks  greater  than  100  percent  were  lost  for  plotting 
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Table  A. 2  Absolute  Difference  Between  Expected  and 

Observed  Cumulative  Mortality  Data  (Expected 
Data  Determined  by  the  Extended  Method) 


Age 

Observed 

Expected 

Absolute 

Difference 

t 

Q  (t) 

^ecw 

Q  (t) 
xecw 

Di 

5 

.013 

.013 

.000 

10 

.019 

,021 

.002 

15 

.025 

.029 

.004 

2b 

.033 

.040 

.007 

25 

.042 

.052 

.010 

30 

.052 

.066 

.014 

35 

.063 

.083 

.020 

40 

.076 

.102 

.026 

45 

.095 

.126 

.031 

50 

.124 

.159 

,035 

55 

.167 

.209 

.042 

60 

.230 

.287 

.057 

65 

.320 

.401 

.081 

70 

.441 

.  5*  0 

.109 

75 

.587 

.,'11 

.124* 

80 

.737 

.849 

.112 

85 

.869 

.935 

.066 

90 

.953 

.970 

.017 

95 

.990 

.981 

.009 

00 

1.00C 

.984 

.016 

*Maxitmxn  absolute  difference 


Allowable  absolute  difference  at  the  .05  sigr.if ' cance 
level  is  found  from  Table  3.1  to  be: 


1.36 


m 


043 


1 
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Table  A. 3  Comparison  of  Weibull  Parameters  Determined  ■ 

By  Two  Different  Methods  j 

3 

* 

—  - - - - -  -  —  i— i - 

Extended  Method  Method  of  Chapter  7  .f 


■  j 


N 

e 

a 

62 

N 

e 

a 

19 

♦ 

N 

c 

a 

188 

H 

c 

a 

76 

N 

w 

a 

750 

N 

w 

a 

905 

Vc 

* 

0 

t. 

a 

0 

3fc 

a 

0 

X  c 

a 

0 

r 

j 

»w 

a 

20 

a 

0 

a 

49 

a 

4.85 

j 

j 

% 

a 

57 

»c 

a 

35.9 

\ 

1 

a 

54 

*v 

a 

77.5 

i 

1 

a 

.65 

J»e 

a 

1.6 

11 

\ 

1. 

K 

2.5 

/c 

a 

2.81 

j 

f 

- 

«w 

a 

5.9 

J»v 

a 

7.2 

; 

i 

l 


1 
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purposes, 
meters  as 
suggested 


Table  A. 3  provides  a  comparison  of  the  para- 
determined  by  the  extended  method  and  the  method 
in  Chapter  6. 


APPENDIX  B 


BASIC  LANGUAGE  COMPUTER  PROGRAMS  FOR  APPLYING  THE  PROPOSED 
RELIABILITY  SUMMATION  MODEL  WITH  KNOWN  THEORIZED,  OR  GRAPH' 
ICALLY  ESTIMATED  PARAMETERS. 


(The  programs  included  in  this  Appendix  are  samples  of  the 
programs  used  to  analyze  magnetron  failures  in  Chapter  9 
and  would  require  modification  to  be  of  more  general  use.) 


t 


14? 
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10  READ  N,E,B1,N1,C,B2,G2,N2,W,B3,G3,N3 

11  LET  T-100 

15  PRINT  N 

16  PRINT  E,B1,G1,N1 

17  PRINT  C,B2,G2,N2 

18  PRINT  W,B3,G3,N3 

19  PRINT  "38  MAGNETRON  FAILURES  GROUPED  DATA  100  HR  CLASS 

INTERVAL" 

20  PRINT 

22  LET  <*1-0 

30  LET  AI-<T-G1)/N1 

35  IF  A1>0  THEN  40 

36  LET  Al-0 

40  LET  A2-(T-G2)/N2 

45  IF  A2>0  THEN  50 

46  LET  A2-0 

50  LET  A3-(T-G3)/N3 

55  IF  A3>0  THEN  60 

56  LET  A3-0 

60  LET  A4-AltBl 
70  LET  A5»A2tB2 
80  LET  A6-A3tB3 
82  LET  R8-1-91 
90  LET  R1-EXP(-A4) 

100  LET  R2“EXP(-A5) 

110  LET  R3-EXP(-A6) 

120  LET  R4-(E/N)*R1 
130  LET  R5-(C/N)*R2 
140  LET  R6-(W/N)*R3 
150  LET  R7-R4+R5+R6 
160  LET  Q1-1-R7 
162  LET  F1-R8-R7 
164  LET  L1-F1/R8 

169  PRINT 

170  PRINT  "AGE»"T 

180  PRINT  "R-"R8,"PDF«"F1,"9-"91 

181  PRINT  "FR-"L1 

133  PRINT  "RE-"R1,"RC-"R2,"RW-"R3 

190  LET  T-T+100 

200  IF  T>2500  THEN  900 

210  GO  TO  30 

220  DATA  38, 10,. 69, 0,12. 1,25, 1.25,0, 620, 3, 20, 0,2310 
900  END 


144 


10  READ  N,E,Bl,Gl,Nl,C,B2,G2,N2,W,B3fG3,N3 

15  PRINT  N 

16  PRINT  E,B1,G1,N1 

17  PRINT  C,B2,G2,N2 

19  PRINT  " 38^MAGNETR0N  FAILURES  UNGROUPED  DATA" 

20  PRINT 

21  DIM  T(40) 

22  FOR  1*1  TO  38 

23  READ  T(I) 

30  LET  A1«(T(I)-G1)/N1 

35  IF  A1>0  THEN  40 

36  LET  Al-0 

40  LET  A2-(T(I)-G2)/N2 

45  IF  A2>0  THEN  50 

46  LET  A2-0 

50  LET  A3-(T(I)-G3)/N3 

55  IF  A3>0  THEN  60 

56  LET  A3-0 

60  LET  A4-Al*Bl 
70  LET  A5-A2tB2 
80  LET  A6-A3tB3 
82  LET  R8-1-Q1 
90  LET  R1-EXP(-A4) 

100  LET  R2«EXP(-A5) 

110  LET  R3»EXP(-A6) 

120  LET  R4-(E/N)*R1 
130  LET  R5-(C/N)*R2 
140  LET  R6-(W/N)*R3 
150  LET  R7-R4+R5+R6 
160  LET  Q1-1-R7 
162  LET  F1-R8-R7 
164  LET  L1-F1/R8 

169  PRINT 

170  PRINT  »AGE-"T(I) 

180  PRINT  "R-"R8,"PDF-"F1,"Q-"Q1 

185  PRINT  "RE»"R1 » "RC-"R2 ,  ’,RW«"R3 
190  NEXT  I 

nl  MW3M0, .  69, 0, 12.  i,  25, 1.25, 0.620  ^3. 20,0,2310 

300  DATA  0,1.1.5,1.7,;3.  0,15.  -5,28.36.  ^  -  433 

330  DATA  1630,2088.9,2340.9,2343 
900  END 
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